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NOVEL COMPOUNDS 

Tlrid of the Invention: 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 
S production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae, 

10 Background of the Invention; 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago, Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 

15 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Macleod and McCarty using this microbe. Despite the vast amount of research with S. 
pneumoniae, many questions concerning the virulence of this microbe remair *^ " 
While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans. pneumolyses, PspA Complement 

20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is certainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacterium is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 

25 critical information for the screening and characterization of novel antibacterials which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention: 

This invention provides novel proteins, particularly those from Streptococcus 

30 pneumoniae, strain 0100993, characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
5 whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

10 It is an object of the invention to provide polypeptides that have been identified as novel 

polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
a known amino acid sequence or sequences of other proteins such as the proteins identified under 
the heading Identity in Table 1. 

It is a further object of the invention to provide polynucleotides that encode polypeptides 

15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 
forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 

25 genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants the polynucleotides set forth in the Sequence Listing and 
polypeptides encoded thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

30 In accordance with certain preferred embodiments of the invention, there are provided 

products, compositions and methods for assessing expression of the sequences the Sequence 
Listing, treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, 
such as for example infection of cerebrospinal fluid, assaying genetic variation, and 
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administering a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred mbodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
5 invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
1 0 polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 

polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component capable of providing a detectable signal in response to the 
1 5 binding or interaction of the polypeptide or polynucleotide with the compound; and determining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or abse' « of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and 
20 antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for administration to a cell or to a multicellular 
organism. 

25 Various changes and modifications within the spirit and scope of the disclosed invention 

will become readily apparent to those skilled in the ait from reading the following descriptions 
and from reading the other parts of the present disclosure. 
Detailed Description of the Invention: 

Each of the DNA sequences provided herein may be used in the discovery and 

30 development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the amino terminal regions of the 
encoded protein or the Shine-Delgarno region can be used to construct antisense sequences 

3 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
encoding sequence. These sequences are useful as a source f regulatory elements for the 
control of bacterial gen expression. Such sequences are conveniently isolated by 
5 restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 

10 invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 

15 candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se t or 

20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
1) Signature Tagged Mutagenesis (STM) 

25 This technique is described by Hensel fit al., Science 269: 400-403( 1 995), the 

contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 

30 means {e.g., transposons) such that unique DNA sequence tags are inserted in close 

proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 
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In Streptococc us pneumoniae, because the transposon system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et al.. J. Bacteriol. 159:870 
(2984) the contents of which is incorporated by reference for background purposes. 
5 2) In Vivo Expression Technology (IVET) 

This technique is described by Camilli & al., Proc . Natl . Acad . Sci . USA . 91 :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
IVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 
10 implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by. 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 
15 fragments cloned into the plasmid vector capable of supporting transcription of the 

recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
is introduced into a host and at various times a' er infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 
20 upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differential display 

This technique is described by Chuang el al., J. Bactenql . 175:2026-2036 (1993), 
the contents of which is incorporated by reference for background purposes. This method 
25 identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns'. 



5 
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4) Generation of conditi nal lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. ct al. t Gene 123:17-24 (1993); 
Neuwald, A. F. filflL, Qcn£ 125: 69-73(1993); and Takiff. H. E.filaL J. Bacteriol . 
174:1544-1553(1992), the contents of which is incorporated by reference for 
5 background purposes, identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, are generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 

10 separate promoter from coding region of a gene whose expression is essential for cell 

viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 

15 absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemical^ labeled precursors into DNA, RNA, protein, lipid, 
peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

20 5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, J.. Methods in Enzvmologv 204: 
3-18(1991), the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 

25 replica plating and growth at different temperature (e.g. 42°C to identify ts, 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monitoring of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 

30 allows matching with unknown ORF. 
6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PCR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
5 knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PCR primers employed it should be understood that the 
bacterial mRNA preparation need not be free of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 
10 obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase treatment to remove contaminating DNA. Preferably the process is optimised 
15 by finding those conditions which give a maximum amount of S. pneumoniae 16S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typir lly a 5' dye labelled primer is used in each PCR primer pair 
in a PCR reaction which is terminated optimally between 8 and 25 cycles. The PCR 
products are separated on 6% polyacrylamide gels with detection and quantification using 
20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 
so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 

Streptococcus pneumoniae, strain 0100993 has been deposited at the National 
Collection of Industrial and Marine Bacteria Ltd. (NCIMB), Aberdeen. Scotland under 
NCIMB number 40794 on 1 1 April 1996. and a Streptococcus pneumoniae, strain 0100993 
DNA library in E. coli was similarly deposited on 17 April 1996 under NCIMB number 
40800. 



25 
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The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
01O0993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 

5 sequence can act as PCR primers in a process of PCR-based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae. 

10 strain 0100993 in E. coli or some other suitable host is probed with a radiolabelled 

oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 

15 directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis, T., Fritsch, E.F. and Sambrook f J. in MOLECULAR CLONING, 
A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening 
By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 

20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 
25 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 
30 characterized by the deduced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A, G, C, TAJ 
representations for nucleic acid bases, the term "N" is also used. "N" means that any of the 
four DNA or RNA bases may appear at such a designated position in the DNA or RNA 

8 
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sequence, except that N cannot be a base that when taken in combination with adjacent 
nucl otide positions, when read in the correct reading frame, would have the effect of 
generating a premature terminati n codon in such reading frame. 

Thus, the present invention includes polynucl otides encoding the same 
5 polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
10 allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
1 5 the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein s'. ^uence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
20 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
25 controlling transport of a polypeptide from the cell. The polypeptide having a leader 

sequence is a prcprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a prcsequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH r terminus. It is appreciated, however, 



30 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence at either the 5* or 3 ( terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacteria] host. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleotides, which hybridize under stringent conditions to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions" 
means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides w' xh retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 
conditions is overnight incubation at 42°C in a solution comprising: 50% formamide, 5x 
SSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x 
Denhardt's solution* 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in O.lx SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, et a/.. 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., 
(1989), particularly Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 
polynucleotides of the Sequence Listing under stringent hybridization conditions with a 
probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 

10 
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"Identity," as known in the an and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
5 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in {Computational Molecular 
Biology, Lesk. AM., ed.. Oxford University Press. New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith. D.W., ed.. Academic Press. New York. 1993; 
Computer Analysis of Sequence Data, Part I. Griffin, A.M.. and Griffin. H.G., eds., 
10 HumanaPress. New Jersey. 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov. M. and Devereux, J., eds.. 
M Stockton Press, New York. 1991; and Carillo. H., and Lipman, D.. SIAM / Applied 
Math.. 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 
15 are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to, 
the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (1984)), 
BLASTP, BLASTN, and FASTA (Atschul, S.F. et al.. J. Molec. Biol. 215: 403-410 (1990). 
The BLAST X program is publicly available from NCBI and other sources {BLAST 
20 Manual, Altschul, S.. et al, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. 
Mol. Biol. 215: 403-410 (1990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
consisting of the polynucleotide of the Sequence Listing is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other words, to obtain a polynucleotide haying a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
the nucleotides in the reference sequence may be deleted or substituted with another 
nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the reference 
sequencemayoccurattheS or3 terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in ih reference sequence or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example, 95% identity to a reference amino acid sequence of selected from the 

5 group consisting of the amino acids of the Sequence Listing is intended that the 
amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 

10 amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 

15 anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 

20 Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. §1 12. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 

25 license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
30 Thus, an analogue includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) on in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
5 one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
10 purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues arc deemed to be within the scope of those skilled in the art from the teachings 
herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
15 The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occuning polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
20 and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 
25 In accordance with yet a further aspect of the present invention, there is therefore 

provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 
30 Host cells are genetically engineered (transduced or transformed or transfected) 

with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a cosmid, a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 

13 



WO 97/37026 PCT/US97/05306 
g nes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to th ordinarily skilled 
artisan. 

Suitabl expression vectors include chromosomal, nonchromosomal and synthetic 
5 DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 

10 endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E, coli. 
lac or trp, the phage lambda promoter and other promoters known to control expression 

15 of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 

contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequer - s for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 

20 dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline orampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control 11 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 

25 in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translational processing. See, e.g., U.S. Patent Nos. 4,431,739; 

30 4,425,437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, P^ and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
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mous raetallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art. and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

10 An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e.. RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 
1 5 sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 
20 above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene off. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
25 expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
identification peptide imparting desired characteristics, e.g.. stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 

15 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 

5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, 

10 pNH8A, pNH16a, pNHl 8A. pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3. 

pDR540, pRITS (Pharmacia). Eukaryotic: pBlueBacffl (Invitrogen), pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

1 5 Examples of recombinant DNA vectors for cloning and host cells which they can 

transform include the bacteriophage 1 (£. coli). pBR322 (£. coll), pACYC177 (£. coli) t 
pKT230 (gram-negativr bacteria), pGVl 106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria), pME290 (non-£. coli gram-negative bacteria), pHV14 (£. coli and 
Bacillus subtilis), pBD9 (Bacillus), plJ61 (Streptomyces), pUC6 (Streptomyces), YIp5 

20 (Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 

generally, M DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et aL ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

25 Polypeptides can be expressed in host cells under the control of appropriate 

promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et al. t Molecular Cloning: A Laboratory Manual, Second Edition, Cold 

30 Spring Harbor, N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
conv nient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression. it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo; i.e.. capable of replication 
under its own control. 

A "vector" is a replicon. such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecule" refers to the polymeric form of 
de xyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 
restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3* direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3' 't minus by a translation start codon (e.g., ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in 
addition to the -10 and -35 consensus sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A eel] has been "transformed" by xogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomaJ element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone" is a population of cells derived from a single cell or common ancestor by 

mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
15 molecule in nature. 

TOs invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic agents. Detection of a polynucleotide or polypeptide o' the invention in a 
eukaryote. particularly a mammal, and especially a human, will provide a diagnostic method for 
diagnosis of a disease. Eukaryotes (herein also "individual(s)"), particularly mammals and 
20 especially humans, infected with an organism comprising a polynucleot.de of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual s cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PCR or other amplification technique prior 
25 to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of prokaryote present in an individual, may be made by 
an analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence 
Point mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 
'0 sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digestion or by differences in melting tempemtures. DNA sequence 
differences may also be detected by alterations in the electrophone mobility of the DNA 
figments in gels, with or without denaturing agents, or by direct DNA sequencing See eg. 
Myers et al., Science. 230: 1242 (1985). Sequence changes at specific locations also may bJ 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 

method. See, eg., Cotton et al., Proc. Natl Acad. ScL. USA, 85: 4397-4401 

(1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be 
detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as, for example, GeneScan. RNA or 
cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 
and analyze mutations. These primers may be used for, among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way, mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 
using any on of the methods well known in the art for the quantation of polynucleotides, 
such as, for example, amplification, PCR, RT-PCR, RNase protection, Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to normal control tissue samples may be 
used to detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and ELISA assays. 

In accordance with yet a further aspect of the present invention, there is provided 
I the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 

20 



WO 97/37026 PCT/US97/D5306 
In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
5 substrates and ligands in. for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See. e.g., Coligan a ai. Current Protocols in Immunology 
/(2): Chapter 5 (1991). 

The invention also provides a method of screening compounds to identify those which 
10 enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. For example, to screen for agonists or 
antagoists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 
15 labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, Le., without inducing the effects of a polypeptide of 
20 the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorimetric labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 
25 polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay that combines such polypeptide and a potential antagonist with polypeptide- 
binding molecules, recombinant polypeptide-binding molecules, natural substrates or ligands, or 
substrate or ligand mimetics. under appropriate conditions for a competitive inhibition assay. 
30 Polypeptides of the invention can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
5 molecule, without inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
10 to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, / Neurochem 56: 560 (1991); 
OUGODEOXYNUCLEOT1DES AS ANTISENSE INHIBITORS OF GENE EXPRESSION CRC 
Press, Boca Raton, FL (1988), for a description of these molecules).. Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 
15 Each of the DNA sequences provided herein may be used in the discovery and 

development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
20 sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
25 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block protein-mediated mammalian cell invasion 
by, for example, initiating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
a/.. Infect. Immun. 60:2211 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and; to block 
30 the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutically 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 
15 antibodies. The term antibodies also includes chimeric, single chain, and humanized 

antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be .,sed for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
20 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing thai 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative' as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
30 which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative' as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
5 polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
10 or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
15 polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Miistein, Nature . 256:495-497(1975)), the trioma technique, the 
20 human B-cell hybridoma technique (Kozbor et al., Immunology Today 4:72(1983)), and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
25 products of this invention. 

Using the procedure of Kohler and Miistein (aipia, ( 1 975)), antibody-containing 
cells from the immunised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a cell line with high binding affinity and 
30 favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Alternatively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes f lymphocytes from humans screened for possessing ami-Fbp or from naive libraries 
(McCafferty, J. et ai, £iatlU£ 348:552-554(1990), and Marks. J. etai. Biotechnology 
5 10:779-783(1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. eial.. Nature 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
10 The antibody may be either intact antibody of M r approx 150,000 or a derivative of 

it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun. 
A., Scjfinse 240: 1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bispecific' antibodies. 

The antibody of the invention may be prepared by conventional means for example 
15 by established monoclonal antibody technology (Kohler, G. and Milstein, C. (supra . (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse. 
W.D. et at., Sciejicc ' 46:1275-1281 (1989). 

Preferably tne antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
20 polypeptides of the invention. The choice of vector for the expression system will be 

determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, 
human HeLa. Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt, A. etal., Uaau& 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 

Preferably the antibody or derivative thereof is modified to make it less 
immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be •humanised'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones. P. et aL, Mature_32 1:522-525 (1986), or Tempest et al., 
Bjoigcimology. 9:266-273 (1991). 
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The modification need not be restricted to ne of 'humanisation' ; other primate 
sequences (for example Newman, R. et a/., Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drug. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 
can be followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 
The use of a polynucleotide of the invention in genetic immunisation will 
preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et a/.. Hum Mol. Genet. 1:363 (1992); Manthorpe et a/., Hum. Gf %J3aSL- 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu et ai, L 
ftiol. Chem . 264:16985 (1989)), coprecipitation of DNA with calcium phosphate 
(Benvenisty & Reshef, Proc Natl Aca d Sci USA. 83:9551 (1986)), encapsulation of DNA 
in various forms of liposomes (Kaneda et a/., ScklKfi 243:375 (1989)), particle 
bombardment (Tang et al> lialuis 356:152 (1992)); Eisenbraun et aU PNA Cgll Piftl 
12:791 (1993)) and in vivo infection using cloned retroviral vectors (Seeger et a/., Era* 
Nat'l. Acad. Sci. USA 81:5849 (1984)). Suitable promoters for muscle transfection include 
CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 
for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drag 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1% to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the f rmulation. 

For administration to human patients, it is expected that the daily dosage level of 
5 the active agent will be from 0.01 to 10 mg/kg, typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age, weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can. of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
10 invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 ug/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 

Within the indicated dosage range, no adverse toxicologicals effects are expected 
with the compounds of the invention which would preclude their administration to suitable 
pa* ents. 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 
20 "Plasmids" are designated by a lower case p preceded and/or followed by capital 

letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 ug of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 M l of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 M g of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37° C are ordinarily 
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used, but may vary in accordance with the suppliers instructions. After digestion the 
reaction is electroph resed directly on a polyacrylamide gel to isolate the desired fragment. 

Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel, D. et al % (1980) Nucl ic Acids Res., 8:4057. 

5 "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

10 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T., et al, aiEOU P- 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase Cligase") per 0.5 \yg of approximately equimolar amounts of the 
DNA fragments to be ligated. 

15 Example 1 

Isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DN As disclosed herein bv virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 

20 sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et al., J. Bacteriol . 159:870 (1984), is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 
based shearing) followed by gel fractionation and end repair employing T4 DNA 

25 polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in ExolL The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel & al., Science 269: 400-403(1995). Although a number of 
plasmids can be used for this purpose, a particularly useful plasmid is pJDC9 described by 

30 Pearce et al, Mol. Microbiol . 9: 1 037 ( 1 993) which carries the erm gene facilitating 
erythromycin selection in either E. coli or S. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the polylinker cloning sites. The tagged 
plasmids are introduced into the appropriate S. pneumoniae strain selected, inter alia, on the 
basis of serotype and virulence in a murine model of pneumococcal pneumonia. 
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It is appreciated that a seventeen amino acid competence factor exists (Havastein et 
al.. Eri2£. Mail Acjid, ScL USA 92: 1 1 140-44 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proportion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 
5 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The S. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
10 and/or C57B1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veber et al. (J, Antimicrohiol. Chemother (]9Q 3) in which 10 5 cfu 

inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8-10 days for C57B1/6. 
15 Infection yields in the lungs approach 10* cfu/lung. IP administration is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising several thousand 
strains is subjected to the virulence test. Mutants with attenuated virulence are identified by 
hybridization analysis using the labelled tags from the "input" and "recovered" pools as 
20 probes as described in Hensel sL al. Science 269: 400-403( 1 995). S. pneumoniae DNA is 
colony blotted or dot blotted. DNA flanking the integrated plasmid is cloned by plasmid 
rescue in E. coli (Morrison et al., J, Bacierjp.l- 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences given herein and the 
appropriate ORF is identified and function confirmed for example by knock-out studies. 
Expression vectors providing the selected protein are prepared and the protein is configured 
in an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
plasmid to isolate full-length cloned virtilence genes whose function can be confirmed by 
"knock-out" studies or other methods, which are then expressed and incorporated into a 
30 screen as described above. 

The individual full length sequences given herein are summarized in the following 
Tables (Table 1 and Table 2). Under the column in Table 1 labeled "Identity" there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
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ORF encodes more than one ORF. For example, SEQ ID NO:224 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l, but no other ORF was detected. On the 
other hand, SEQ ID NO:225 and 226 have ORF#s 1 and 2 respectively, indicating that th y 
were both encoded by the same polynucleotide. This can also be seen in Table 2 where the 
5 polynucleotide of SEQ ID NO: 2 encodes the ORFs of SEQ ID NOS: 225 and 226. Table 1 
also shows the start ("START" column) and stop ("STOP" column) codons for each ORF 
and their positions in the encoding polynucleotide sequence. The SEQ ID NOS of the 
polypeptides of this table are linked to both a deduced identity in this table and a 
polynucleotide sequence in Table 2 which encodes each polypeptide. The "Direction*' 
10 column in Table 1 shows the direction of the ORF encoding each poypetide in this table. 
"Forward" denotes the sense orientation and "Reverse" denotes the antisense orientation of 
the ORF. 
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TAPL5 i 







ICodon 




Position 






SEQ ID NO: 


Identity 


ORF#iStan 


Stop 


Start 


Stop 


Direction 


224. 


Phosphate Transport ATP-Binding Protein 
PSTB.- Escherichia 


1 


-CAT 


TCA- 


2 


553 


Reverse 




probable ATP binding protein - Bacillus 
subtilis 


1 


-CAT 


TCA- 


38 


781 


Reverse 


226. 


Nopaline Transport System Permease 
Protein Nocm. - Agrobacteriurn Tumefaciens 


2 


-CAG 


TTA- 


781 


1512 


Reverse 


ITT 

227. 


Aspartate Aminotransferase A (EC 2.6.1 1) 
(Transaminase A) (ASPAT.) - Rhizobium 
Mcliloti. 


1 


ATG 


TAG 


480 


671 


Forward 


228. 


ISL2 protein - Lactobacillus helveticus 1 1 


ATG 


TAA 


549 


947 


Forward 




ISL2 protein * Lactobacillus helveticus 12 


ATG 


TAG 


889 


1353 


Forward 


230. 


Unknown 13 


ATG 


TGA 


1421 


1570 


Forward 


231. 


Unknown |2 


-CAT 


CTA- 


111 


413 


Reverse 


232. 


Unknown 


1 


ATG 


TGA 


133 


282 


Forward 


233. 


Acyl Carrier Protein (ACP.) - Rhizobium 
Meliloti. 


1 


-CAT 


CTA- 


673 


906 


Reverse 


234. 


Unknown 


1 


-CAT 


TTA- 


15 


137 


Reverse 


235. 


Unknown 


*> 


-CAT 


CTA- 


68) 


989 


Reverse 


236. 


Sulfate Transport System Permease Protein 
CYST (Fragment). - Synechocystis SP. 
(Strain PCC 6803). 


2 


-CAT 


TTA- 


336 


689 


Reverse 


237. 


probable transposase (insertion sequence 
1S861) - Streptococcus agalacuae (str ain 
COH-M 


-» 


-CAT 


TC AG- 


149 


454 


Reverse 


238. 


Unknown 


2 


-CAG 


ITA- 


567 


851 


Reverse 


239. 


"PTS SYSTEM 


1 


ATG 


TA A 


AQ 


ATI 


Forward 


240. 


M PTS SYSTEM 


2 


-CAT 


TTA- 


151 


789 


Reverse 


241. 


nitrogen fixation protein (nifS) homolog - 
Haemophilus influenzae (strain Rd KW 20) 


I 


-CAT 


TCA- 


1 


207 


Reverse 


242, 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


1 


-CAT 


TCA- 


84 


431 


Reverse 


243. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


ATG 


TAA 


22 


321 


Forward 


244. 


Unknown 


2 


ATG 


TAA 


272 


469 


Forward 


245. 


ATP-dependent Clp proteinase (EC 3.4.21.-) 
chain cipL - Lactococcus lactis subs p. lactis 


1 


ATG 


TGA 


83 


760 


Forward 
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olasmid dUCL22 












246. 


Beta-Glue sidasc (EC3.2.1.21) 

fGenrinhiase) (TVllnhiacA^ \ fR^tn.rv. 

Glucoside GL Ucohydroiasc) (Amygdalase). 
- Bacillus Subtilis. 


1 


-CAT 


TCA- 


153 


599 


1 Reverse 


247. 


GWcine BetAine/I •Pmliim Tnntnnrt ATP 

Bindine Protein Prov. - Escherichia Col. I. 


•) 




I AA 


1 CO 


887 


Forward 


248. 


Unknown 


3 


-CAT 


TTA- 


1102 


\?U 


Reverse 




Jto Kioosomal Protein SI 1 (BSL 1). - 

DobJiiUd OUDW15. 


1 


ATG 


TGA 


15 


164 


Forward 


250. 


DNA - Directed RNA Polymerase Alpha 

Chain 7 7 1 f\\ /Tram********!!** a ul a 

v - nain i*-^, x././.o)(i ranscnptase Alpha 

f*hiiifil - Rnr*illnc CiiRtilie 
^iiaiii;. • DauiilUa OUDUIIS. 


2 


ATG 


TAA 


282 


1217 


Forward 


251. 


Peptide Chain Release Factor 3 fRF-3). - 
Bacteroides Nodosus (Dichelobacter Nod 
Osus). 


1 


ATG 


TAG 


212 


667 


Forward 


252. 


Unknown 


1 


ATG 


TAG 


267 


353 


Forward 


253. 


Riboflavin Synthase Alpha Chain (EC 
2.5.1.9). -Bacillus Subtilis. 


2 


-CAT 


CTA- 


213 


662 


Reverse 


254. 


Unknown 


3 


-CAT 


TTA- 


833 


1045 


Reverse 


255. 


Unknown j ] 


-CAT 


TTA- 


83 


424 


Reverse 


256. 


mesi protein - Leuconostoc mesenteroides 


2 


ATG 


TAG 


448 


1302 


Forward 


257. 


Transacetylase BMTD (EC 2.3.1.*). - 
Bacillus Subtilis. 


1 


CTG 


TAG 


3 


320 


Forward 
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SEC 

258. 



ID NO: 



259. 



260. 



261. 



262. 



Identity 

ribose-phosphate pyrophosphokinase (EC 
2.7.6.1.) - Bacillus caldolvticus 



Unknown 



Unknown 



nitrogenasc C (nifC) homolog - 
Haemophilus influenzae (strain Rd 



UnJcown 



ORF# 

1 



Codon 



Start 

CTG 



ATG 



ATG 



ATG 



ATG 



Stop 

TAG 



TAA 



TAG 



TAG 



TAA 



Start 

I 



66 



108 



631 



606 



Stop 
642 



Direction 

Forward 



614 Forward 



590 



855 



Forward 



263. 



Unknown 



ATG 



TGA 



495 

1466 

594 



Forward 
Forward 



264. 
265. 

266. 
267. 



Unknown 

Acetyl Esterase (EC 3. 1 .-.-). - Caldocellum 
ISaccharolvticum, 
Unknown 

Triosephosphaste Isomerase (EC 53.1.1) 
CUM) . • Bacillus Subtilis. 



ATG 
ATG 

-CAT 
-CAT 



TGA 
TAA 

TCA- 
TTA- 



639 
274 



270 



Forward 
Forward 



157 Reverse 
665 Reverse 



270. 
271. 



272. 
273. 



275. 



276. 
277. 



Branched-chain Amino Acid 
Aminotransferase (EC 2.6.1.42) 
(Transamin ase B). - ES Cherichia Coli 
branched-chain-amino-acid transaminase 
homolog - Haemophilus influenzae (strain 
Rd KW20) 

JDnaK protein - Lactococcus lactis 

Ketol- Acid Reductoisomerase (EC 1.1.1 .86) 
(Acetohydroxy-Acid Isomeroreductase). - 
Lactococcus Lactis (Subsp. Lactis) 
(Streptococcus Lactis) 



Unknown 

Amidophosphoribosyltransferase Precursor 
(EC 2.4.2.14) (Glutamine Phosphoribosy 
Lpyrophosphaste Arnidotransferase) ( Aiase). 
Bacillus Subtilis 

Pyrroltdone-Carboxylate Peptidase (EC 
3.4.19.3) (5-Oxoprolyl-Peptidase). - STR 
Eptococcus Pyogenes. 



ATG TAG 



ATG 



CTG 



TAA 



TGA 



110 



708 



50S Ribosomal Protein LI 6. - Mycoplasma 
Capricolum. 



serine O-acetyltransferase (EC 2.3.1.30) - 
Bacillus stearotherrnophilus 
Unknown 



ATG TAA 



99 



-CAT 
-CAT 



-CAT 



ATG 



736 



Forward 



842 Forward 



749 
428 



Forward 
Forward 



CTA- 
CTA- 



TCA- 



278 
152 



156 



TAA 



33 



-CAT 
ATG 



CTA- 
TAG 



631 
775 



803 



416 



577 
61 



Reverse 
Reverse 



Reverse 



Forward 



1194 
648 



Reverse 
Forward 
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278. lUnkn wn 


[ 


-CAT i 


ZTA- 


165 


335 I 


Reverse 


279. 1 
( 

1 


Lipoprotein Signal Peptidase (EC 3.4.23.36) 
Prolipoprotein Signal Peptidase) (Signal 
Peptidase II) (Spase II). - Staphylococcus 
Aureus. 


1 


ATG 


rAA 


56 


517 


Forward 


280. 


Unknown 


1 


ATG 


TAA 


214 


534 


Forward 


281. 


Alpha-Acetolaciate Decarboxylase (EC 
4.1. 1. 5). - Bacillus Subtilis. 


1 


-CAT 


CTA- 


104 


445 


Reverse 


282. 


Dihydrodipicolinate Synthase 9EC 4.2.1.52) 
(DHDPS). - Bacillus Subtilis. 


3 


-CAT 


TCA- 


675 


884 


Reverse 


283. 


Polyribonucleotide Nucleotidyltransferase 
(EC 2.7.7.8) (Polynucleotide Phosphor 
Ylase) (Pnpase). - Photorhabdus 
Luminescens. 


1 


-CAT 


TCA- 


1 


855 


Reverse 


284. 


Single-Strand Binding Protein (SSB) fHelix- 
Destabilizine Protein). - Bacillus S Ubttlis 


1 


-CAT 


TTA- 


128 


598 


Reverse 


285. 


ATP-Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia Coli. 


2 


-CAT 


CTA- 


195 j482 


Reverse 


286. 


ATP- Dependent CLP Protease ATO- 
Bindine S 1 .unit CLPX. - Escherichia Coli 


4 


-CAG 


TCA- 


676 


990 


Reverse 


287. 


N-(5'-Phosphoribosyl) Anthranilate 
Isomerase (EC 5.3.1.24) (PRAI). - 
Lactococcus Lactis (Subsp. Lactis) 
(Streptococcus Lactis). 


1 


ATG 


TAA 


83 


712 


Forward 


288. 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


1 


ATG 


TGA 


277 


591 


Forward 


289. 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


3 


ATG 


TGA 


918 


10641 Forward 

i 


290. 


Possible beta-eaiactosidase precursor 


1 


-CAT 


CTA- 


120 


545 I Reverse 


291. 


Unknown 


1 


-CAT 


TTA- 


77 


283 1 Reverse 


292. 


Unknown 


1 


-CAT 


TTA- 


1 


198 Reverse 


293. 


*Phospho-2-Dehydro-3-Deoxyheptonate 
Aldolase 


1 


-CAT 


TTA- 


217 


690 


Reverse 
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I 

SEQ ID NO: [identity 



ORF# 



Codon 



Start 



Stop 



Position 



Start 



Stop 



Direction 



294. 



295. 
296. 



Unknown 



-CAT 



Unknown 



TTA- 



-CAT 



TCA- 



429 



599 Revere 



739 



936 



298. 

299. 
300.. 

301. 
301 



303. 
304. 



305. 
306. 
30,7. 
308. 



309. 
310. 



11. 



312. 
313. 



Sorbitol Dehydrogenase (EC 1.1.1.14) (L- 
Iditol 2-Dehydrogenase). - Bacillus Sub 
Tilis. 

EBG Operon Repressor. - Escherichia Coli 



ATG 



TAA 



celiobiose phosphotransferase system celA 
Bacillus stearothermophilus 
Unknown 

Adenylosuccinate Lyase (EC 43.2.2) 
(Adenylosuccinase) (ASL). - Bacillus Subtil 
Is 

ATP-Binding Protein BEXA. - Haemophilus 
Influenzae. 

L-Lactate Dehydrogenase (Cytochrome) (EC 
1 .1.2.3). - Escherichia Coli. 



-CAT CTA 



64 



668 



-CAT TTA- 



Phosphate Transpon System Permease 
Protein PSTC. • Escherichia Coli. 
Sulfate Transpon ATP-Binding Protein 
Cysa. - Synechococcus SP. (Strain PCC 794 
2) (Anacvstis Nidulans R2Y 



Unknown 
Unknown 
Unknown 

Phosphoribosylfomiylgiycinamidme Cyclo- 
Ligase (EC 6.3.3. 1 ) (AIRS) (Phosphoribo 
Syl*AminoimidazoIe Synthetase) (AIR 
Synthase). - Bacillus Suhtilis 
Unknown 

SMS Protein. - Escherichia Cnti 



-CAT 
-CAT 



ATG 



ATG 



ATG 



-CAT 



249 



TCA- 581 
TTA- 99 



TGA 



TAA 



TGA 



CTA- 



-CAT 
-CAT 
ATG 



sigma 42 protein - Enterococcus faecalk 



Unknown 
Guanyiate Kinase (EC 2.7.4.8) (GMP 
Kinase). - Escherichia Col i. 
Unknown 



TTA 

TCA- 

TAG 



HI 



337 



507 



110 



-CAT TCA 



ATG 

-CAT 

-CAT 




-CAT 
-CAG 



838 
282 
1108 
331 



TAA 313 
TCA- 102 



TCA- 



TTA- 
TTA- 



8 



76_ 
415 



480 



1060 



Forward 



Reverse 



566 

964 
809 

404 
507 



□07Q 
838 



Reverse 

Reverse 
Reverse 



Forward 
Forward 



1077 
749 



Forward 
Reverse 



Reverse 
Reverse 
1290 (Forward 
477 (Reverse 



765 
416 
487 



390 
849 



Forward 
Reverse 



Reverse 



Reverse 
Reverse 
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J 1 0. 


r 1 J OVSiCTIl 


L 


—PAT 


IL.A— 


loo 




Reverse 


319. 


L-Fucose is merase (EC 53 A.-) - 
Escherichia Coli. 


1 


-CAT 


CTA- 


9 


482 


Reverse 




Unknown 


J. 


-t-A J 


TTA- 


495 


650 


Reverse 


321. 


Unknown 


1 


ATG 


TGA 


130 


231 


Forward 


322. 


3-OxoacyHAcyi-Camer Protein] Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase). - Escherichia Coli. 


1 


-CAT 


TCA- 


51 


200 


Reverse 


323. 


3*Oxoacyl-[ Acyl-Camer Protein] Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase). - Escherichia Coli. 


2 


-CAT 


TCA- 


231 


614 


Reverse 


324. 


2-lsopropyimalate Synthase (EC 4.1.3.12) 
(Alpha-lsopropylmalate Synthase) ( Alph A- 
IPM Synthetase). - Lactococcus Lacus 
(Subsp. Lactis) (Streptococcus Lactis). 


1 


-CAT 


TTA- 


31 


231 


Reverse 


325. 


2-lsopropyimalate Synthase (EC 4.1.3.12) 
( Alpha-lsopropylmalate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lacus 
(Subsp. Lactis) (Streptococcus Lactis). 


2 


-CAT 


TCA- 


191 


334 


Reverse 


326. 


Unknown 


3 


-CAT 


TCA- 


309 


452 


Reverse 




O.Tcnnrnnulmalaf* Qunthac* (T*C* A 1 "X 

x-i&uprupyiiTiaiaic oynuiasc (cv> h.i.j.izj 
( Alpha- Isopropylrnalaie Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


A 
*♦ 


— V» A 1 


1 I A— 




1 1 TT 


Reverse 


328. 


Unknown 


2 


-CAT 


TCA- 


170 


469 


Reverse 


329. 


Unknown 


1 


ATG 


TAG 


105 


473 


Forward 


330. 


Strscaa Ncbi gi: 310629NCBI gi: 473 - 
Streptococcus eordonii (strain PK488) DNA 


I 


-CAG 


CTA- 


81 


665 


Reverse 
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SEQ ID NO: 

331. 
332. 



Identity 

Unknown 

lysyl aminopcptidasc (EC 3.4.1 1.15) 
precursor - Lactococcus lactis 



ORF# 

1 



jCodon I 
Start Istop 



ATG 



-CAT 



TGA 



I Position 
Start 

228 



TCA- 



107 



Stop 

374 



766 



Direction 
Forward 



Reverse 



333. 



Indoie-3-Glycerol Phosphate Synthase (EC 
4.1.1.48) (IGPS). - Lactococcus Lactis 
(SUBSP. Lactis) (Streptococcus Lactis) 



-CAT 



TCA- 



127 



369 



Reverse 



334. 



Anthranilate Phosphoribosyitransferase (EC 
2.4.2. 18).-Lactococcus Lactis (Sub SP. 
Lactis) (Streptococcus Lactis). 



-CAA 



TCA- 



366 



641 



Reverse 



335. 



Tagatose*6-Phosphate Kinase (EC 2.7.1.-) 
(Phosphotagatokinase). - Lactococcus L 
Actis fSubsp. Lactis) (Streptococcus Lactis) 



ATG 



TGA 



42 



524 



Forward 



336. 



Unknown 



ATG 



TGA 



73 



337. 



Unknown 



-CAT TTA 



32 



338. 



Unknown 



I ATG 



TGA 



306 



572 



339. 



Acetolactate Synthase Large Subunit (EC 
4.1.3.18) (AHAS) (Acetphydroxy-Acid Syn 
thase Large Subunit) ( ALS). - Lactococcus 
Lactis (SUBSP. Lactis) (Streptococc us 
Lactis). 



ATG 



TAG 



59 



502 



Forward 



340. 



341. 
342. 



346. 
347. 
348. 
349. 



350. 
351. 



Penicillin-Binding Proteins 1A/1B. 
Bacillus Subtilis. 



ATG 



TGA 



535 



Unknown 



Amhraniiaie Phosphoribosyitransferase (EC 
2.4.2.18). - Lactococcus Lactis (SUB SP. 
Lactis) (Streptococcus Lactis). 



ATG 



TAG 



165 



■CAT 



TCA- 



Unknown 
Unknown 
Unknown 
Unknown 



Unknown 
NIFS Protein Homolog (Fragment). 
Lactobacillus Delbrueckii (SUBSP. 

Bulgancus). 

'Ornithine Carbamovltransferase 



ATG 
-CAT 
-CAT 
CAT 



-CAT 
ATG 



TAA 
CTA- 
CTA- 
CTA- 



TTA- 
TAG 



159 
315 
468 
482 



380 
87 



720 Forward 



488 'Forward 
394 Reverse 




503 
644 
587 
631 



Forward 
Reverse 
Reverse 
Reverse 



757 
797 



502 I Forward 
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353. 


PSEGLI NCBI gi: 499660 - Pseudomonase 
Diitida. 


1 


-CAT 


1*1 1 A 

Tl A- 


j\J 




Reverse 


354. 


Unknown 


1 


CTU 


TA A 

TAA 


i 
1 




Forward 


355. 


Thioredoxin. - Streptomvces Clavuligerus. 


I 


ATG 


TAG 


CO 

JO 


JO 


Forward 


356. 


D-aianine permease (dagA) homokog - 
Haemophilus influenzae ( strain Rd KW20) 


1 


ATG 


TAG 


185 


343 


Forward 


357. 


Unknown 


1 


r> A "T* 

-CAT 


TTA- 


78 


3/1 


Reverse 


358. 


"DNA Polvmerase III 


1 


-CAA 


TCA- 


O") 


301 


Reverse 


359. 


Transport ATP-Binding Protein Coma. - 
Streptococcus Pneumoniae. 


1 


-CAT 


TTA- 


112 


552 


Reverse 


360. 


Unknown 


2 


ATG 


TAA 


A A* 

445 


C Af\ 

540 


Forward 


361. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


-CAT 


TCA- 


45 


41o 


Reverse 


362. 


Licid Protein. - Haemophilus influenzae 


2 


ATG 


TAA 


268 


633 


Forward 


363. 


Unknown 




ATG 


TAA 


527 


640 


Forward 


364. 


Giutamine Transport ATP- Binding Protein 
GLNO. - Escherichia Coli 


1 


CTU 




i 




Forward 


365. 


Unknown 


1 


ATG 


TAG 


184 


303 


Forward 


366. 


Unknown 


1 


ATG 


TGA 


794 


919 


Forward 


367. 


Lipoamide Dehydrogenase Component (E3) 
of Pyruvate Dehydrogenase Complex (EC I. 
8.1.4) (Dihydrolipoamide Dehydrogenase). - 
Azotobacter Vinelandii. 


1 


ATG 


TAA 


3 


416 


Forward 
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ICodon 




Position 






SEO ID NO: 


Identity 


ORF#lstut 


Stop 


Start 


Stop 


Direction 


368. 


Orotate Phosphoribosyitransferase (EC 


1 


l-CAT 


CTA- 


21 


311 


Reverse 




2.4.2.10) (OPRT). - Bacillus Subtilis. 












369. 


Unknown 


1 
1 


ATG 


TAA 


28 


309 


Forward 


370. 


SPOOB-Associated GTP-Binding Ptroein. - 
Bacillus Subtilis. 


1 
1 


ATG 


TGA 


110 


538 


rurwfiiu 


371. 


Transport ATP-Binding Protein COMA. - 
Streptococcus Pneumoniae. 


1 


-CAA 


TCA- 


47 


679 


rvc verse 


372. 


prephenate dehydrogenase (EC 1.3.1.12) - 
lactococcus lactis. 




ATG 


TAG 


271 


492 


I vM WOJ U 


373. 


O-Syaloglycoprotein Endopeptidase (EC 
3.4.24.57) (Glycoprotease). - Pasteurella 
Haemolvrica. 


1 
1 




CTG 


TAA 


1 


450 


rurwsi u 


374. 


Unknown 


1 


-CAT 


TTA- 


7 


?8? 




375. 


Unknown 


1 


TTG 


TAA 


2 


?6S 




376. 


"DNA Polymerase in 


1 


ATG 


TGA 


3 


401 


Forward 


377. 


Unknown 


2 


ATG 


TAA 


361 


516 


Forward 


378. 


Dihydrodipicolinate Synthase (EC 4.2.1.5V 
(DHDPS). - Bacillus Subtilis. 


1 
1 


—PAT 


TTA- 


179 


457 


Reverse 


379. 


Unknown 


2 


-CAT 


TCA- 


104 


32? 


Reverse 


380. 


Unknown 


2 


-CAT 


TCA- 


248 


649 


Reverse 


381. 


Unknown 


1 


-CAT 


TTA- 


56 


*>4 


Reverse 


382. 


Unknown 


2 


-PAT 


TTA- 


254 


409 


Reverse 


383. 


Unknown 


1 


ATG 


TAG 


138 


617 


Forward 


384. 


Unknown 


2 


-CAT 


TTA- 


225 


470 




38S. 


Unknown 


2 


ATG 


TAA 


747 




386. {Unknown 


1 


ATG 


TAG 


294 


441 


Forward 


387. 


Unknown 


2 


ATG 


TGA 


356 


544 


Forward 


388. 


Unknown 


1 


-CAT 


CTA- 


4 


14) 


Reverse 


389. 


Unknown 


1 


CTG 


TAG 


1 


579 


Forward 


390. 


Unknown 


2 


-CAT 


TTA- 


309 


452 


Reverse 


391. 


Phosphopentomutase (EC 5.4.2.7). - 
Escherichia Coli. 


1 


ATG 


TAA 


3 


233 


Forward 


392. 


D-Alanyi-D-Alanine Carboxypeptidase 
Precursor (EC 3.4.16.4) (DD-Peptidase)(DD 
- CarboxypeptidaseXCpase)(PBP5). - 
Bacillus Subtilis. 


1 


-CAT 


TTA- 


52 


537 


Reverse 


393^ 


Na+ and Cl-dependem gamma-aminobutryic 
acid transDorter homoloe-HaemoDhilus 


1 


TTG 


TGA 


2 


268 


Forward 
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influenzae (strain Rd KW20) 














in a 


Unknown 




ATfi 


1 On 


no 




Forward 


lor 


ainyaroiipOaiTuoe acnyurogenasc ice 

1 .0. 1 ~ rCJUOaClCr CArDln01ICU5 


1 


ATT. 

nlu 


Tf!A 

1 OA 






Forward 


370. 




L 




TfiA 

lUn 


**r 1 




Forward 


397. 


Beta-Glucasidase A (EC 

j.x. i .x i ^oenuooiaseA^euooiase A*>eia-LJ- 

kJiUwvaiuc uiucunyuruioscj. Vvjosuiuiuni 

Thermocellum. 


1 


ATC 


TAA 


184 


453 


Forward 


398. 


Uracil Permease. - Bacillus CaldoWticus. 


1 


ATG 


TAA 


93 


353 


Forward 


100 


Unknown 


I 


ATT: 


TAp. 

IAu 




510 


Forward 


400. 


ligoendopeptidase F- Lactococcus lactis 


\ 


ATG 


TGA 


134 


310 


Forward 


401 


Streptococcus Faecalis DNA. 


9 




TAA 
X /v/\ 


109 




Forward 


402. 


Unknown 


2 


ATG 


TAA 


376 


507 


Forward 


403. 


Exodeoxyribonuclease Small Subunit (EC 
3. 1.11. 6) (Exonuckease VII Small Subunit). - 
Escherichia Coli. 


2 


-CAT 


TCA- 


470 


682 


Reverse 


404. 


ATP-Depenem DNA Heltcase RECC (EC 
3.6. 1 .-). - Escherichia Coli. 


\ 


ATG 


TGA 


3 


455 


Forward 

i 
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Codon 




Position 






SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


405. 


Possible thiamin biosvnthetic enzvme 


1 


-CAT 


TTA- 


15 


347 


Reverse 


406. 


SPOU Protein. - Escherichia Coli. 


2 


ATG 


TAA 


322 


618 


Forward 


407. 


Maionyl Coa-Acyl Carrier Protein 
Transacyiase (EC 2.3.1.39). - Escherichia 
Colt. 


1 


ATG 


TAG 


85 


498 


Forward 


408. 


Unknown 


1 


-CAT 


CTA- 


17 


118 


Reverse 


409. 


nucleoside diphosphate kinase (ndk) 


1 


CTG 


TGA 


1 


159 


Forward 




homolog- Haemophilus influenzae (strain 














RdKW20) 














410. 


Nucleoside Diphosphate Kinase (EC 
2.7.4.6)(NDK) (NDP Kinase). - Escherichia 
Co LI. 


2 


ATG 


TAG 


215 


481 


Forward 


411. 


Unknown 




-CAT 


TTA- 


21 


368 


Reverse 


412. 


Unknown 




-CAT 


TCA- 


162 


314 


Reverse 


413. 


Unknown 




ATG 


TAA 


187 


417 


Forward 


414. 


Unknown 




ATG 


TGA 


316 


417 


Forward 


415. 


Unknown 




ATG 


TGA 


316 


453 


Forward 


416. 


Enolase (EC 4.2. 1 . 1 1 K2*Phosphoglycerate 
Dehydratase)(2-Phospho-D0Glycerate 
Hydro-Lvase). - Bacillus Subtilis. 




-CAT 


TTA- 


4 


435 


Reverse 


417. 


Unknown 




CTG 


TGA 


1 


1 




418. 


Unknown 




ATG 


TGA 


39 


383 Forward 


419. 


PILB Protein. - Neisseria Gonorrhoeae. 




-CAT 


TTA- 


145 


327 1 Reverse 


420. 


Unknown 




ATG 


TGA 


285 




421. 


Unknown 




CTG 


TAG 


2 


379 1 


Forward 


422. 


integrase/recombinase (xprB) homolog - 
Haemophilus influenzae (strain Rd KW20) 




ATG 


TAA 


305 


421 | 


Forward 


423. 


Unknown 




-CAT 


CTA- 


173 


436 Reverse 


424. 


Unknown 






CTA- 


182 


427 [Reverse 


425. 
426. 
427. 


Unknown 
Unknown 




ATG 

-CAT 

ATG 


TAA 

CTA- 

TGA 


49 
21 
177 


372 Forward 
446 Reverse 
380 Forward 


428. 
429. 
430. 

431. 


Possible phosphatase 
Unknown 

Phosphoenolpymvate Carboxylase (EC 
4.1.1.31). - Corvnebacterium Glutamicum. 
Unknown 




ATG 
ATG 
ATG 


TAG 
TGA 
TGA 


112 

no 

337 


402 Forward 
271 Forward 
561 Forward 


432. 


SOS Ribosomal Protein L31. - Bacillus ) 




ATG 1 
ATG r 


rcA 

rAA | 


214 | 
U2 | 


324 Forward 
426 1 Forward 
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SUDQilS. 














433. 


glycosyl trans I erase ugiis) nomoiog - 

I la*— miikiltic infll fcffftin Rrf 1CAV20^ 


i 
i 


ATG 


TGA 


127 


399 


Forward 




Unknown 


2 


-CAT 


CTA- 


244 


435 


Reverse 


435. 


Unknown 


2 


ATG 


TGA 


118 


309 


Forward 


436. 


Factor essential for Expression of 
MethicilUn Resistance. - Staphylococcus 
Aur Eus. 


2 


-CAT 


TTA- 


168 


434 


Reverse 


437. 


Unknown 


1 


-CAT 


TCA- 


2 


214 


Reverse 


438. 


Unknown 


1 


TTG 


TAA 


2 


142 


Forward 


439. 


Unknown 


1 


ATG 


TAA 


30 


191 


Forward 
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Codon 


IPositi n 






SEQ ID NO: 


Identity 


ORF# 


Stan 


Stop 


Start 


Stop 


Direction 


440. 


Aspartate Aminotransferase (EC 2.6. 1.1) 
(Transaminase AXASPAT). - Bacillus SP. 
(STRAIN YM-2V 




-CAT 


TCA- 


63 


416 




441. 


Unknown 




ATG 


TAA 


52 


342 




442. 


Unknown 




-CAT 


TCA- 


210 


455 




443. 


Unknown 




TTG 


TGA 


2 


517 




444. 


ATP-Dependent DNA Helicase RECG 
(EC 3.6. 1 .-) - Escherichia Coli. 




ATG 


TGA 


83 




rorwaio 


445. 


Unknown 




ATG 


TAA 


70 


184 


Forward 


446. 


Cell Division Protein FTSA. - Bacillus 
Subtilis. 




TTG 


TAA 


3 


371 


Forward 


447. 


Unknown 




ATG 


TAG 


70 


441 


Forward 


448. 


Unknown 




ATG 


TAG 


104 


454 


Forward 


449. 


Unknown 




CTG 


TAA 


1 


159 


Forward 


450. 


Unknown 




ATG 


inn 


1 OA 


347 


Forward 


451. 


Un' nown 




ATG 


TflA 


1 1 
J] 


421 


Forward 


452. 


L..*nown 




ATG 


luA 




416 


Forward 


453. 


Unknown 




ATG 


TGA 


290 


418 




454. 


Unknown 




ATG 




3 


269 


Forward 


455. 


JAG Protein (SPOIIIJ Associated Protein). - 
Bacillus Subtilis. 




ATG 


TGA 


93 


365 


Forward 


456. 


"DNA -3-Methyladenine Glycosidase I (EC 

3.2.2.20)(3-Methyladenine-DNA Glvcosylas 
EI 




ATG 


TAG 


91 


282 


Forward 


457. 


"Glucan 1 




-CAT 


TTA- 


4 




458. 


Unknown 




-CAT 


CTA- 


245 


400 i 


Reverse 


459. 


Glutamate/Aspanate Transport ATP- 
Binding Protein GLTL. - Escherichia Coli. 




-CAT 


TCA- 


81 


218 


Reverse 


460 


Unknown 




-CAT 


TTA- 


]03 


492 


Reverse 


461. 


Unknown 






TGA 


305 


484 


Forward 


462. 
463. 
464. 


Unknown 
Unknown 
Unknown 


2 
2 


-CAT 
-CAT 
-CAT 


TCA- 
TTA- 
TCA- 


29 

572 

652 


355 
818 
1026 


Reverse 
Reverse 
Reverse 


465. 
466. 
467. 


Unknown 
Unknown 

Llcpyrda NCB1 gi: 5 1 1014 - Lactococcus 
lactis. 


2 
2 
1 


-CAT 

ATG 

ATG 


TTA- 

PGA 

TGA 


318 
719 
134 


764 
805 
472 


Reverse 
Forward 
Forward 
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UWuKJWn 


1 
i 




TGA 1385 




rorwaiu 


469. 


Unknown 


2 


ATG 


TAA 


587 


721 


Forward 


470. 


salE protein - Neisseria meningitidis 


1 


-CAT 


TCA- 


23 


460 


Reverse 


471. 


Unknown 


2 


-CAG 


TTA- 


717 


1319 


Reverse 


472. 


Naphthoaie Synthase (EC 4. t .3.36) 
(Dihydroxynaphthoic Acid 
Synthetase )(Dhna Sy Nthetase). - 
Escherichia Coli. 


1 


ATG 


TGA 


97 


414 


Forward 
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SEOIDNO: 

473. 
474. 
475. 
476. 
477. 


1 

Identity 

Unknown 

Unknown 

Unknown 

Unknown 

Multiple Sugar-Binding Transport ATP- 
Binding Protein MSMK. - Streptococcus 
MUTA NS. 


ORF# 

I 

1 
1 
1 

2 


Cod n 
Start 

CTG 

-CAT 

-CAT 

-CAT 

ATG 


Stop 

TGA 

TCA- 

TTA- 

TTA- 

TAA 


Positi n 
Start 

1 

341 
217 
499 
407 


246 
748 
858 
729 
571 


Forward 
Reverse 
Reverse 
Reverse 
Forward 
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Table 2 shows the corr lation between the SEQ ID NO f each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frame(s) encodes. For example, the DNA of SEQ ID NO: 1 encodes one polypeptide, that 
of SEQ ID NO;224. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO:225 and SEQ ID NO:226. 



TABLE 2 



DNA 


Protein (open reading frame) 






1 


224 




2 


tic *>*m 
225,22o 




3 


227 




4 


t>c no *>in 
22o,22y,Z3U 




c 

5 


23 I 




a 
0 


232 




7 


233 




o 

8 


234 






233 




1U 


230 




1 1 


23 / 




11 


23o 




IJ 






14 


240 




15 


241 




16 


242 




17 


243.244 




18 


245 




19 


246 




20 


247,248 




21 


249.250 




22 


251 




23 


252 




24 


253,254 




25 


255,256 




26 


257 




27 


258 




28 


259 




29 


260,261 




30 


262 
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31 


263 




32 


264 




33 


265 




34 


266 




35 


267 




36 


268.269 




37 


270 




38 


271 




39 


272 




40 


273 




41 


274 




42 


275 




43 


276 




44 


277 


| 


45 


278 


J 


46 


279 


[ 


47 


280 


j 


48 


281 


— ( 


49 


282 




50 


283 


[ 


51 


284 


~J 


52 


285.286 


"\ 


53 


287 


—J 


54 


288,289 


[ 


55 


290 


1 


56 


291 


A 


57 


292 




58 


293 


J 


59 


294,295 


— 1 


1 60 


296,297 


J 


61 
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1 
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1 


63 


301 
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64 


302,303 
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65 


304,305 
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67 


307 
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68 
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70 
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71 
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72 
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73 
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74 


315,316.317 




75 
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76 


319.320 




77 
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78 
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79 
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80 
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81 
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82 
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83 


331 




84 
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85 


333.334 




86 
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87 
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88 
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89 
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90 
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91 
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92 
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93 
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94 
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95 
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96 
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97 
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98 
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99 
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100 
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358 
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359 




110 


360 
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361 




112 
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113 
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114 


364 
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115 365 



116 366 



117 367 



118 368 



119 369 



120 
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122 372 
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125 375 
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129 
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130 380 
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398 
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136 
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144 


394 ~ ■ -J 
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401 — 
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402 ~ 
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403 • 
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156 406 " 
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157 


407 


15R 

1 JO 


408 


ISO 


400 410 
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HJ 1 


1 Al 
JO J 
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H1Z 


1A7 
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1 AA 
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l / 1 
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1 /Z 
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I ID 
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HZH 
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HZJ 


nc 
17D 
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1 /o 


HZ/ 


III 


+Zo 


17fl 

J /o 


470 
HZV 


1 7Q 


4 in 

HjU 


i fin 


41 1 
HJ 1 




417 


1 C7 
loZ 


411 
HDD 


1 ill 

loo 


414 


I OH 


415 




41A 


1 flA 
ISO 


417 
Hj / 


to / 


41 R 
HJo 


1 Oft 
loo 


41Q 


1 AO 


AAf\ 


ion 


441 

HHI 


101 


447 
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445 


195 
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196 
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197 


448 


198 


449 
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213 465 



214 466 
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468,469 



217 470.471 



472 
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450 
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451 




201 
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208 
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460 




210 


461 




211 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Black, Michael 
Hodgson, John 
Knowles , David 
Nicholas, Richard 
Stodola, Robert 

(ii) TITLE OF THE INVENTION: Novel Compounds 

<iii) NUMBER OF SEQUENCES: 477 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

<vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
<B) FILING DATE: 01-APR-1997 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/014690 

(B) FILING DATE: 02-APR-1996 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/025788 

(B) FILING DATE: 22-AUG-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Gimmi , Edward R 

(B) REGISTRATION NUMBER: 38,891 
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(C) REFERENCE / DOCKET NUMBER: P50466 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


1: 






CTCAAACATG 


GTCTCTTCTA GTTGCATGGT 


CGCAATCGGA 


TCCAAGGCTG 


AGGCTGGGCT 


60 


CATCCATTAA 


GAGGATATCT GGCTTAACAG 


AGATGGCACG 


AGCGA n \CAG 


AGACGTTGTT 


120 


GCTGACCACC 


TGATAAGGTC AAGGCTGACT 


TGTGGAGATC 


GTCTT* .AACC 


TGATCCCAAA 


180 


GGGCAGCCTG 


ACTAAGGGAG GTTTCTACGA 


TTTCATCTAG 


GACTTGCTTA 


TCCTTAACTC 


240 


CAGCACGTTC 


ATGCGCAAAG GTAATATTAC 


GGTAAATTGA 


CTTAGCAAAT 


GGATTGGGGC 


300 


GTTGAAAAAC 


CATTCCAATG TGTTTACGCA 


TTTCATAAAC 


GTTGATTTCT 


GGACGGTTGA 


360 


CATCAATTCC 


ACGATAGAGA ATCTGCCCAG 


TTACTTTAGC 


AATATCAATG 


GTATCATTCA 


420 


TGCGATTGAG 


ACTGCGTAAG TAGGTAGATT 


TCCCCGATCC 


CGACGGACCA 


ATCAAAGCTG 


480 


TAATTTTATT 


TCCTTTCAAA TTGCATATCA ATCCCCTTAA 


TGGATTCATT 


TTTACCATAG 


540 


TAAACATGGA 


CATCCTTAGT AGAAAGGGCT 


ACTTTTTCTT 


CAGGAAAGGT 


AAGGATATGC 


600 


TTCTCATCCC 


AGTTATATGT TGACATGGCT 


TCTCCTTTAG 


GCAGCGGTTA 


ATTTCTTGTG 


660 


TAGATAGCTT 


CCGAACTTAC GAG 






683 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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CTAGTCCAAC 


TAACTGAGTT 


TTCCTTTATC 


TATTATATCA 


AATATAAGTC 


CGTTTGTAAC 


D U 


TAGTGAAGAA 


TTCTTTTGTC 


CGCTCTTCTT 


TAGGGGTGTG 


k*AI AAiv- i LA 






CAGACTCGAT 


GATTTTCCCC 


TTATCTAAGA 


AGAGAATTTT 


ATCCGCAACT 


TGGGCTACAA 


1 Oft 

J.OU 


AGGGCATGTC 


ATGACTGACC 


AAAATCATGG 


TCTGACCTGA 


CTTAGCAGCA 


TCTGCAATAG 


*> A A 

^40 


ACTTTTCTAC 


TTCACCGACC 


AATTCTGGGT 


C AAGGGC TG A 


AGTTGGTTCG 


TCTAAGAGCA 


JUU 


AAACATCTGG 


TTTCATAGCA 


AGCGCACGCG 


CTAGGGCAAC 


CCGTTGCTTC 


TGTCCACCTG 


~i r r\ 


ATAAATGGCG 


AGGATAATGG 


TTTTCACGGT 


CCGAAAGCCC 


AAC CTT AGC C 


AACTCTTCCT 


vi "i a 
4^0 


TGGCAATCTT 


AGTCGCTTCT 


TGGTCAGATA 


ATTTCTTGAC 


x x ^* x x r*/^ x x ^ 

AACAACCAAG 


CCTTCTTTCA 


480 


CATTATCAAG 


TGCTGTTCGG 


CGTTCAAATA 


AATTAAACTG 


mmpo x x x x r*r* 
TTGGAAAACC 


ATAGACAACT 


a a 
D4U 


TACGGCGTAG 


GGCAAGGATT 


TCTTCTTGAG 


TGATTTTAGA 


x x x x n^/"* x x r^tj* 
AAAATCAACT 


XXX X XP^JiT 

vi AAAAAL U AT 


£ftft 


CAATCTGAAT 


AGAGCCACTG 


TCAGGTGTTT 


CAAGATAATT 


^> x ^ x ^mn^*^ x 

GAGACTGCGA 


AGAAAGGTTG 


CCA 


ATTTTCCAGC 


TCCTGAAGAA 


CCAATCAAGG 


CTACAACTTC 


CCCTTTTTGA 


ATATCCAAGT 


"7 ^ A 


TCAGATGATC 


CAAGACAGTC 


TGTCCTGAAA 


AGGATTTGCT 


TAAATTCGAA 


ATCTTAATCA 




TTAACGAAGG 


TCTCCTTTCA 


CATCTGTTTG 


CACTGTATCA 


GGTGCAGAAA 


TAGC CATTTT 


S4U 


TCTCTCGATG 


AAACGACCGA 


GGCTTTCAAT 


TCCGATATTG 


ACTACCCAAT 


AAACAAGGGC 


AAA 

y uu 


AACAGAGATG 


AAGCGTTCAA 


AATAGCGATA 


ATCAGCTCCA 


CCTAGAATCT 


GAGCTTGGGC 


OCA 

you 


AAAGACTTCC 


ACAACACCCG 


CACTAAAAGC 


TAGAGATGTT 


CCCTTGGTCA 


AACCGATGAG 


1 A "» A 

1020 


GGAATTAATC 


AAGGTTGGAG 


TAGCTACCAC 


CGCTGCATTA 


GGAATAATCA 


C TCGTCGATA 


1080 


AACTTGCGCT 


CGGGTCATAC 


CCAGACTGCG 


CGCCGCCTCA 


ATCTCACCAG 


GATTAACTGA 


1X40 


GAGAATGGCT 


GCACGAATGG 


TTTCACTAGC 


ATAAGCTGCC 


TCATTAAAGG 


^X XXX ^ f^/^ X ^ 

CAAAAGCGAC 




AATCGCAAAA 


GCTGCAGCTG 


GAATCGCATT 


GATATTGAGA 


CCAGTTCCCC 


ATTGCTGATT 


1260 


GAGGGCTTTC 


AAAGCCAAAG 


GGATTCCGTA 


GTAGGTCAAC 


ATGAGTTGCA 


CCAAAATCGG 


1320 


TGTCCCTTTT 


AAGAAACTAA 


CAAAGAAGGC 


CTGCAAGGGA 


TATAAAATCT 


TGACACGATT 


1380 


GATCTTCACA 


ATGGCAAAAA 


GAAGCGCCAA 


AACCAAGCCA 


AAAAGGGCAC 


CGCCAATTGT 


1440 


CAACATAATT 


GTTGTTGGAA 


GTTGTTGGAC 


AATTCTAGGG 


ATTCCATCAA 


AGACCGAACG 


1500 


TAGGCTAAAC 


AG 










1512 



(2) INFORMATION FOR SEQ ID NO: 3: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA TNTTTGCGGA TGAAATCTAT 60 

GACCGCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 120 

TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCCGTGTG 180 

GGATGGATGG TCTTGTCTGG CCCTAAGACT CATGTTAAGG GCTATATCGA AGGGCTCAAT 240 
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ATGCTGTCCA 
ATCACTGTCG 
CTGGTGGACG 
GTTTGTCTGC 
TGTACCGTAT 
TCTTGGTTCA 
TTCCTCGTGT 
ATCGTAGATA 
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ATATGCGCCT TTGCTCTAAC GTTTTGGCTA CCAACGCTGC ATTAGGAATA 300 

TACAAACTTG CGCTTGGGGG GTCACCAATC AGTCGATGAA TTGCTTCTTC 360 

AATCTACGAG CAAAGAAATT TCATCTATAA TGCCATTCAA GATATTCCAG 420 

CGTTAAACCC AAGGCGGGGC TCTATATCTT CCCAAAAATC GACCGCAATA 480 

CGATGATGAT GAGCAGTTTG TCCTTGATTT CTTGAAGCAG GAAAAGGTTC 540 

TGGTCGAGGC TTTAACTGGC AGGAACCAGA CCACTTCCGT ATCGTTTACC 600 

TGATGAGTTA GCCCAAATCC AAGAAAAGAT GACTCGTTTC TTGAAACAGT 660 

GGGCTTGCAT TCGAAAAAGC TGGAAACATT TGCCTAGAG 70 9 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 



CTGCTTCAGA 
TAATACCTCT 
AGCTTTCGAG 
TCTTTCTATT 
TTAGTCATTT 
TACTCTATAA 
ATAACCGTAT 
TCTTCCGTGA 
GCCTCAGCTT 
GTAGTTCTAT 
TTGGTGTTCA 
TTAAACACGC 
CTCTTCAATA 
ACGAAAGCAA 
TTACGATTTC 
TACAAATCAA 
GCTATGAAGG 
GTGAACTATA 
GGAAAAATCT 
ACTCCACGTA 
GCGCTATCCA 
ATGTTTTCAA 
GCTGGCATTA 



ACCTGCCAAA 
TTGGGAGTGT 
TTTCTTGGCT 
TTTTCTNCTA 
TCTTGATGTA 
AATCTTTTCC 
TATCTAAAAC 
TATTTTATTT 
CAATATTTCT 
GAATGATGAA 
GCGCACGACT 
AAAAGGTGGA 
TGTGCGAGAA 
CTTAATCCGT 
AAGAACTCCT 
TCGCCCTAAA 
CTCAAGCGAT 
GTCATGATAT 
TGGCTGATAG 
AATCCAGCAA 
AGGAGAGAAG 
CAACCTATCG 
TCAATTATGA 



CCAGCACCGA 
GAAGTTAAGA 
CAGGCTGAAA 
CTAGTATAAC 
AAGAGATAGT 
ACATAACGGA 
AGCATAACAT 
TACCTGAAAA 
TACTTTCCAA 
GCAAGTAAAC 
TTTGAAGAGA 
CGAAAACCTA 
TATCGAACTT 
CGGAGCCAAT 
CTCAGTTCTG 
AAAAGAATTA 
TGTCACAAGT 
GAAGTTGTTC 
TGGTTATCAA 
ACTCAAGCCG 
CAAGGTTGAG 
AAATCATCGT 
ACTAGGATTC 



TAAC iTTGAT 
TTC-.6ATTGA 
AAGTCCACAG 
AAAAAAAGGG 
GAGTATTCCA 
TCGATAGGGA 
TCAACATAGT 
AATAGCACAC 
TAGACTCCCT 
AACTAACTGA 
TATTAGCTGT 
AATTAAGCCT 
ATGAACAAAT 
GGGTTGAAGT 
AGGACACGGT 
GCGAATCATT 
CAAGGGAGAA 
AAAATGAGTT 
GGGCCCATGA 
CTAATAGCTG 
AACATCTTTG 
AAACGCTTCG 
TAGTTTTGCA 



ATAAGATTGA 
AAAAGCCAAT 
GGCTTTTNCA 
AAGAAGGNAA 
GTTAAGAATC 
CTGTTATTCT 
GATCTCCTTT 
GCTCACAATT 
GCGAAACAAA 
TGCACGATTT 
ATTAAAAACA 
AGAAGACCTT 
TGCGGCTGAT 
AACTCTTGTT 
AATGATTGAT 
CTGGTAAAAA 
TTGTTTCTTT 
GCAGAAATAT 
AGATATATCC 
AAGATAAAGC 
CCAAAGTAAA 
GATTACGAAT 
GGAAGTCTAT 



GACACGACAC 
CAGACTTACA 
CTCCCACAAA 
ACTTCCCTGT 
AATTATATGC 
ATCTCTTGCT 
AAACTGTGAA 
CTTCCTAATA 
ATATGGTATA 
AAGCGTCTTG 
GCTTATCAAC 
CTTATGGCCA 
TTTGGTATCC 
CAAAGTGGTG 
GCGACGGAAG 
GAAATTTCAC 
GGATATCGCT 
CGGACAAGCT 
TCAAGCACAA 
TTATAACCAT 
AACGTTTAAA 
GAATTTGATT 
TATTTTCCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1°20 
1080 
1140 
1200 
1260 
1320 
1380 
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ATTGTCTGTA AGTCTACTGA CCTTGTTGTT TATCCCAGTC ATGGTTTCTA GTTCGGGCTC 1440 

AGAGTTTCAA AGTGGATGGC AAGAGCATCA ATTGATTGCT GAGAAGGTTA GTAAAACACT 1500 

TGACAAGACA TTTGATAAGG ATGTCAGAAA AATTCCGACC AGTCCAGTTT TATCAAAAAT 1560 

TTGTAGATGA GATGGGAAGG ATTTACTC 1588 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 690 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CTGTTTTTAA 


TACAGCTAAC 


ATCTCTTCAA 


AAGTGGTACG 


CTGAACACCA 


ACAAGACGCT 


60 


TAAATCGTGC 


ATCAGTTAGT 


TGTTTACTTG 


CTTCATCATT 


CATAGAACTA 


CTATACCATA 


120 


TTTTGTTTCG 


CAGGGAGTCT 


AATATTGTCA 


AATACTGGAG 


CGCTCATTGC 


TGGTATACGG 


180 


AATAAGATTG 


GCCCAGCTTC 


GATAACTGGG 


ATACCTGGTT 


CAAAACCAAG 


ATCTGTTGCA 


240 


GCGATTGGTG 


TAAAGATATC 


GTAACCTTTC 


ATAAGGTCTT 


CGTTTACATC 


TTTCACCATG 


300 


ACTGCATCAC 


AGTGAACATC 


ATAACCACGG 


TTTGAAAGTT 


CTTCTTCTAG 


AGCACTTTTA 


360 


ATTTGGTGAC 


TTGAGTTAAC 


ACCTGCACCG 


CAGGCAGCAA 


GAATTTTAAT 


CATTTAGATT 


420 


TCCTCCGATT 


TTATTTTTTA 


ATAGACAAGA 


TTAAGCGGTT 


GCTTCAGCAA 


TGTAAGCATA 


480 


AAGTTTTTCT 


GGTTCGGAAA 


TTTTTGATAG 


GTCTTCAAGA 


TGTCCATTTC 


CTGTGAAAAA 


540 


GTCCATCAAC 


TGAGCCAGAA 


TATTTGTTTG 


ACTTGAACTT 


GAGTTATTGA 


TGATAAAGAA 


600 


GAGCAAGGAT 


ACTTCTACTT 


CCTTATCAGG 


AGCTATCATA 


TTGTGAAAAG 


TTACTGATTT 


660 


TTCTAATCGA 


ACAACCACCA 


CTTTTCTCAG 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACTGCTACCT TTAAGAAGAT AGTAGACGTA TATACTTTTT TAAGAAAATC AAAAAGATAC 60 
TATAAAAAAT CTATTTGTTT ATTGAATTTA AGACTTTGGT AACAAATTGA AAATAAAAAG 120 
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GAGGTATTCA TCATGAATAC AAAAATGATG TCACAATTTT CTGTTATGGA TAATGAAATG 180 
CTTGCTTGCG TTGAAGGTGG AGATATTGAT TGGGGAAGAG AAATTAGTTG TGCAGCAGGG 240 
GTTGCATATG GCGCAATTGA TGGGTGTGCA ACAACGGTTT GATATTTCTA TTGGGACCAT 300 
TTGCTATAGG AATAGGTGTA ACTGGTGCTG CAGGTGGAG 



(2) INFORMATION FOR SEQ ID NO: 7: 



339 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CTGAAGACCT 
TCCGCCACCT 
TTTTGTATTC 
ACAAGTTTAG 
AACGTCTTGA 
TAATCACTTT 
ATACCAGCAA 
GAAAACATTT 
GATAGCGCAT 
CGTGGAAGTT 
GATTTTACCA 
GAACTCCGAT 
GTCAATTTCC 
CATCAAATCA 
AAAGTCCTCT 
TCTCATCTCT 
AGCATGGTAC 
GTCTTGACAA 
AGCTGTTTTT 
AGACCACCAC 
ATGGATTTGA 
TTCATCAAAT 
AATTCATAAG 
AAACCAACAC 
GCGTATTGAT 
AAACCTTTCC 
ATACGACCCA 



CGTGCTACTC 
TCAACTCAAG 
ATAACGAATA 
TATAACAGTA 
ATTAGCTTTT 
TTTACTATTT 
TCAAATTCAT 
TAAACGTTTT 
GATTACAGGC 
TGTGCTTGAG 
GCTTGTCCGA 
TGCTATTTTC 
TCATCGCTGA 
ACTGAGTCAG 
CCCTGTCGCT 
TTTATTCTCC 
GAATCTGGCG 
CAGGTGCCTT 
TCAAACCTCT 
CTGTAATAGC 
GCACAGCGTT 
CACGCGCTTC 
TTTCCTTACG 
GTGGTTGCGC 
GGAGGTGCTG 
CATCTACAGT 
CGATGAAGAA 



CTCCTCCAAC 
CAAGCATCTC 
CCTCTTTTTT 
TCTATTAATT 
TT r ITTCAAA 
A ACTTCCTG 
TCGTAATCCG 
TACTTTGGCA 
TTTATCTTCA 
GATATATCTT 
TATTTCTCAT 
CTTGAATGAT 
TTTCGATACT 
CATCCAAGTC 
CTTGGATAAT 
TGAAAATTCA 
AATCGTACTA 
GACACCAAAC 
GAGGCTGTCC 
TGTCTTGAGC 
TCCCGTGAAA 
CACGTTTCCG 
AAGCGGGTCG 
AATGCCACGA 
GGCTGTATTT 
CGGCAATGTT 
TCCAGCAGCC 



ACCTGCTTTG 
AGTATCCATA 
ATTTTTAATA 
TTTTTCATCC 
ATCATCTCTA 
CAAAACTAGA 
AAGCGTTTAC 
AAGATATTCT 
ACTGTTAGCG 
CATGAGCCCT 
TTCTAAAAAC 
TTTAACCACA 
AAATTCATCC 
GTCTTTCAGA 
GGTCACAATA 
CGCGCAGTCT 
TAAACAGCCT 
AAGACCGCTC 
TTGAGAAGGA 
AAGCCCATGA 
CCATCTGCCA 
ATAAAGTTCA 
CCCTTGCTAC 
ACATTCTTGG 
TCTGCATTAG 
GACATAAGTC 
AACAAAGCAC 



GCAAAATCTC 
ATTTCAAATT 
TTTGTCTTGT 
AAATCTTGAA 
TTTTTAAAAA 
ATCCTAGTTC 
GATGATTTCG 
CAACCTTGCT 
GCTTGAGTTT 
TGATAACCAC 
CATCTACTTC 
TCTCCTACAC 
TCCAGCGTCA 
CTCAAGGATT 
CTGTCAAAAA 
GGGCAACTAC 
TGGCATCGCT 
CACCAACATC 
GGGCACCTAG 
TTCCCATAGC 
CAACAACATC 
AACTTTCATC 
TCTCTGTTCC 
CATAGAAAGA 
CACCGAGGTC 
CAGGACGGTC 
CTGTATTCCC 



CCCAATTGCA 
GTGACATCTT 
TACAAACTTG 
TTGGCATCGA 
AGATGTTTTC 
ATGATTGATA 
ATAGGTTGTT 
TCTCTCCTTA 
GCTGGATTTA 
TGTCAGCCAA 
CGTTGACTTG 
TTTGGAGTTG 
AGATAAACTC 
CTGTCACGAC 
TTTCTTTTTC 
TTCTGTTTCT 
TGAGCCATGA 
TGAATAATTG 
TTTCGCTCGA 
TGTCCCTTCG 
TGCAACGCCA 
AGCCGCCAGT 
GTTGTTGAGC 
ACCTAGGACA 
AAGCATGTCA 
GATATTCTTG 
AGCCGAAAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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ACAGCGTCTG CTTCACCATC TTTGACAGCC TTGGCTGCCA ATACCATACT GGCATTTTTC 1680 
TTATTCCGAA TAG 1693 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



CTAAATCTGC 


TTGCTTAGTC 


CACTTGCTTG 


AGCCAAGGAG 


TCACATACTC 


AAAATTTTCA 


60 


TGAAAGTCAT 


AGGTACCGTC 


TTCTTTTTTA 


GCTGAAAAGA 


AAAGTCCATC 


GTGGTAGAGC 


120 


AAATCCCCGT 


GGGCCATAAT 


TCTGGCAGTT 


TTTTCCTCTT 


CCTACTCCTG 


AGACTTTTGC 


180 


TTAGTCCCCT 


CTTGAGAAAT 


AGTATCTCGT 


TTTTGACTAG 


TCAAGGGATT 


CCTTGGAAGC 


240 


TTTCAAACAA 


CAAGACCAAG 


CCCATTGATA 


AACCAACTGC 


TAGCAAGAGT 


ATCGCCACAA 


300 


ATCCCTTATT 


GCTCCACTTG 


CGATAACTCC 


TAAAAAGTTT 


ACCAAGCCCT 


TCATAAAACG 


360 


AAAAGCTAAA 


CCACCCTGAT 


TTCGATTTTG 


TCTTCTTTGT 


ATCTTCGTTC 


TCCCTACTTT 


42' 


CTTATGCAAG 


CCTTTTCTTT 


TTATTATATC 


ACAGATAAGT 


ATTTCTTTCA 


CAATTGAATT 


4 J 


GAACTTCCCA 


TCTATTTTCT 


ATAAATCCTA 


AATGCCATAA 


TGCTTTCAAT 


TCCTGTCATT 


540 


TTGTGATATC 


ATGTAGAAGA 


AATGAACTAA 


TCCACAGTGG 


CTTATTCCAA 


GTATACCACT 


600 


TGGGCTTTGG 


CAGTAG 










616 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CTAATATAGA ATAATCACCG CCGTTGTGAA AGAACGATTG GATGATAATC CAATCGTTCA 
GGGAAATTGG AAGACCTTGG GTTTCCAATT TAGGCATGAG ACACCTTTGG TGGCTGCTGC 
CGTCCCTCAC AAGCTAAGGT GATTGTTGAA AAAGAGGAAA AAGGAGAAGA AATGAAACCA 
GTAATTTCCA TCATCATGGG CTCAAAATCC GACTGGGCAA CCATGCAAAA AACAGCAGAA 
GTCCTAGACC GCTTCGGTGT AGCCTACGAA AAGAAAGTTG TTTCCGCACA CCGTACACCA 
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GACCTCATGT TCAAACATGC AGAAGAAGCC 
GGTGCTGGTG GCGCAGCGCA TTTGCCAGGC 
ATTGGTGTGC CAGTCAAGTC TCGTGCTCTT 
CAGATGCCGG GTGGGGTGCC TGTTGCGACC 
TAAAACAGGG TTCGGATAAG TTTTTTTGCA 
CCTTATTCTA ACTTAGTCTT AAGATAGGTC 
TTGGCAGCTT GTATGAGTAC CTACCGCAGA 
ACTAAATCAA TCTGACCTGA CTGATAAATA 
TGAGCAGGAT TATCAAAGGC ATGAATATTT 
CGATTCTCAA TCCCAGTAAC CGTCGTGATG 
ACACATTTTT CCGCCTTGTC AATGAGCCTC 
ATAAAACGTC TATGCGTTAT CAAACTCATT 
TCGGAAATTG TCAAGCGATT GGAGGAAATG 
ACCACTTGGG CTTTGGCAGT AGCTAACTGC 
AAGACAGTTC AATTTTTTTG GCATTATTTT 
ATCCTGATGA TTGTTCTGGC GACTTTTGCC 
GCGGTGACGC AGCTAGCCAA TTTAGTTCAA 
ATGGCAAAGC CTATCAGGAA TTCATGGTCA 
ATCTCTAGGT GTAATATAAA CATGTGTCTC 
GAGATGCGCA AAGGTCTCTT TGGTAAGCTT 
CGACAAGATG GCGATATCCT GTCTCATTTT 
TTTAACGAAA jCTTGATTCA GGTCATGAGC 
GTCATGTTT. CGAGAAATGT GACGCTGGCT 
TTCCTTATGC TGATTTTCAT CGTGAAAATG 
GAGGTAGGGA AGCTCAACGC CTATATGGAT 
GTGCTAGGAA TTCAAGAGGA TATGATGGCA 
AAGGCAACCT TTAAAGGAAG AATGTTCTCA 
AGCCTGATTA ATACAGCCAT CGTCATCTTT 

(2) INFORMATION FOR SEQ 
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: CGTAGTCGTG 


GCATCAAGAT 


CATCATCGCA 


3 fin 


: ATGGTAGCTG 


CCAAAACAAC 


CCTTCCAGTC 




' AGTGGAGTGG 


ATTCACTCTA 


TTCTATCGTT 


480 


! ATGGCTATCG 


GTGAACTCTT 


TTTTAGGATA 


540 


> AGGTGGATGA 


TGGCTACATT 


GTAATGTTTT 


600 


TTAAAACCAG 


GTGAAAAGCG 


AAGGCATGCT 


660 


> TGAAGGGAAC 


CCCGTTTGAC 


CATCCTTCCA 




. GAAGAATCCA 


GTCCAGCGAA 


AGCTTGTAAT 


/ OU 


CGAATCTCGG 


CTAAAATGAC 




QAf\ 


ACCGAGTTTA 


ACTCAGCCAT 


CAAGTCATTG 




TTGTAATGTT 


TGATGTTTTC 


ATTACACGAG 


or n 


ACCAATTAAA 


ACAAATGTGG 


TTAGATCCTT 




AACTAATCCA 


CAGCGGCTTA 


TTCCAAGTAT 


±\Ja\J 


GCTAAATATA ATATAAGGAG GAGTAAAATG 




AAGGTCTACA 


AGTTCTCATT 


TGTAGTTGTC 




CAAGCCCTCT 


TTCCAGTCTA 


TTCTGGACAA 


1 *1 C A 


GCTTATCAAA 


ATGGGCAATC 


CAGAACTTGT 


lion 


ATCTTGGCCT 


GCTGGTTTTG 


GGTTCTATTT 


1 1 fin 


ATGACGCGCG 


TGATTGCAGA 


ATCGACCAAC 


A it ft U 


GCTCAGTTGA 


CGGTTTCTTT 


CTTTGACCGT 


X jUU 


ACCAGTGATT 


TGGATAATAT 


CCTCCAAGCC 


XD D u 


AATATTGTTT 


TATACATTGG 


TCTGATTCTT 




CTCATCACCA 


TTGCCAGCAC 


CCCATTGGCT 


i con 


GCACGTAAAT 


ACACCAACCT 


CCAGCAGAAA 


1740 


GAGAGCATCT 


CAGGCCAAAA 


AGCCGTGATT 


1800 


GGATTTCTTG 


AACAAAATGA 


GCGCGTGCGC 


1860 


GGAATTCTTT 


TCCCTGTCAT 


GAATGGGATG 


1920 


GCTGGTTCGG 


CTGTACTTTT 


GAA 


1973 



ID NO:10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CTAAACCAAT AGTTGCTGAA GACCATAATC CAGCTGCTGT TGTTAAACCT TTAACTTTTT 
GATTAGTTAC CATGATTGTT CCTGCCCCTA GGAAACCCAA GCCACTAATT ACTTGGGCTC 
CCATTCGACT AGGATCACCG CTACCATAAC GACTAGTGAT GAACTGATTT GTCATCATTA 
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CAACACAAGT TCCCAAACAA ACTAGTAAGT 
CTCCTCGCTC ATAGCCAACA ATGCCACCGA 
TTTCCCAAAT ACTCAGTTCG TATGAAAGAT 
AATAGGCTTG ATAAATAAAG AGCTGCACTA 
AACATCATTG CCTGTGCATT TAAAGTTGCT 
CCCAATGGTT GGAAAAGTGG ATGGTACAAG 
GAATTAAAGT TTAACACAAC TACTGATAAT 
CGCACCATAG TGTAGAAACT AGATGCACCC 
TCTATACTGA AAAATACAGC CCGAATCATT 
TATGCAATAA GTAAAATAAT TACTGTTCCT 
GGTATATTAT AAGTGAACAT TAATCCTAGT 
ATCAGTGCAC CATACTCAAA GAATTTATCG 
GAAATTACTA TTGCTATAAT TGTAGCAATT 
ACTAGGTAAG GACGAAATGA TTGAGCATCT 
AATTTAGATA GATCTAAAGT TCCCGTCTTG 
ATTATCAAAA CTATTGGTAA CATATAGATT 
AGAATTATTC CATAATGGAG AAGAAAATTA 
AAACAGAAT 



PCT/US97/05306 

AGGTTCTAAT CCCTGCTGCT TGGTTTTTGA 240 

AAAGAATCGC TAAAAAGCAC CTAAGAAGTA 300 

TCATATTATC TCTTACCTCG TTTACCTTGG 360 

GACATAATCA TTAAAATTAC AGAATAAACA 420 

GTTTCATCAG TAGACTGTTT AATAACGATT 480 

AATACAGATA AGTCATAGTC AGATAATAAA 540 

ACGACTGGTA AAATATACGG TATAATAACT 600 

ATACTACGTG CAGCTTCTTC CATATCATTA 660 

CTATAAGAAA ATGGTAATTT TTGAATAGTA 720 

ACTAAAACTA AATTAAATAG TATTAAATGA 780 

GCAATCAATG TACCTGGCAA TATCCATGGT 840 

AACTTACTCT TGTTTTTATG TACAATACGT 900 

ACCGCAGCTA AAATTGCATA AACAACGCTG 960 

GTAAATAAAT TAGCATAGTT CGCTAACGTA 1020 

ATCGTCAACG AGTCTGTAAA CGAGTATAGA 1080 

GCAAACAATA CATACGCAGC AATGTGAGCA 1140 

TCTGCTTCCT TAATACTAGC CTGGTTTTCG 1200 

1209 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CTAAGTCCTT 


TTAGTTTTAT 


CTTAATTCTC 


TTATTGTTGT 


AATAATCAAT 


ATAGTCTATA 


60 


ATGGCTTGTT 


CCAATTGCTT 


AAGCGACTGA 


AACGACTTCT 


CATAACCGTA 


AAACATTTCC 


120 


GATTTCAGAA 


TCCCAAAGAA 


AGATTCCATC 


ATACCGTTGT 


CTTGGCTGTT 


TCCCTTGCGT 


180 


GACATAGATG 


CTTGAATTCC 


CTTACTCTTA 


GGAACCGATG 


ATAAGAATCG 


TGTTGGTATT 


240 


GCCAGCCTTG 


GTCACTATGG 


AGAATCGTAT 


TCTCGTAGTG 


CTTCTCTGTG 


AATGCCTGTT 


300 


CCAACATTGT 


TTGTACTTGT 


TCTAAGTTGG 


GTGAAGTTGA 


AAGATTATAG 


GCGATAATTT 


360 


CGCTATTAAA 


GCCATCTAAA 


ACTGGTGATA 


AGTAAAGCTT 


TTGAGTACTT 


GCTGGAATGG 


420 


CAAATTCTGT 


CACATCTGTG 


TAGCACTTTT 


CCATTGTTTT 


AGAGCCTTCA 


AATTGGCCTT 


480 


GAATGAGATT 


CTCTGCCTTC 


TTACCAACGT 


CTCCTTTATG 


AGAAGATTTC 


GTTTCTGTCG 


540 


CATTTTAGCT 


TGTAAATTGA 


GTACTTTCAT 


CAAGCCTTGA 


ACTCTTTTAT 


GATTTACCAG 


600 


ATAAGCACGA 


TTCCTTAGTT 


CTAAATGAAT 


ACAGCGATAA 


GCATAATTTC 


CCTTGTGTTC 


660 


GATAAAAATG 


GATTGAATTT 


CAGCTTTAAG 


CTCTTGGTCC 


TTATCTGGTT 


TGTNTAG 


717 



60 
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(2) INFORMATION FOR SEQ ID NO: 12: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTAAATCACC AGATTTAACC AAGTCGTAGA TACGGCCTGG TGTTCCAATA ACAATATGAG 
GCTGATTGCT TGCCAATTTC rCAA^C GAGCCTTATC CGTACCACCC 
CCACACGAAC TTCGACATCT GAGTGAGCTG AAATCTGACG CGCTACTTGG tZ^ G 

ssss sszz ™- 

ATTGCTGGAA AATCGGTAAC AAGAAAGTAT GAGTCTTACC TGAACCTGTT TTTGATTCTC 
CTACTAGGTC ACGACCTGCC AAAACAATAG GAATCAACTT GTCTTGCACC TCTGT^G 
TTGTAAATTT TAACTCCTCC AAGGCTTCTC TAATATAGTT TTTAAATTGA A^TTTCGTAA 
ATGACATAAC ATCCTCGATT CTATCTATCT TATCAATTAT ACCATATTTT ATTCCATTAC 

"™ 1™ CTAmCCAG TAGC,TCTCT AGTAAGAAAA 
TTATAGTTCC AACCTCTTTT CAGTTATTAT TTCCAGTTTA ACATAGCATT CAAGCCATAG 

TGATCACTCA CTTGTGGACT C^TTACCA TCAAATACGA CATGTAAATT ^CA^GCT 
AACTCTTTGG TAGTAAAGAC ATAATCGATT CGAAGGGGTT CAGTGTTCCC T^C^Z 
TCAATTTCAG GCGGAACAGT ATAGCTACCA CTTTTCTCTT GAGCAACTTC AAA^t" 
AAAATCTCCA ^ T ^ At ^' A{ ^' '^'^AAATAGCT TGGTAACCTT CCCTGACCTG CTGGGTTGTT 8*0 

851 



(2) 



INFORMATION FOR SEQ ID NO: 13; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CTGTTGAGAT TGTTACGAAA TAACTGAAAA TATTTAAGGA GAAAATATAT GTTGAAACAC 
TTAAACTTAA AAGGTCACTT ATTGACAGCC ATTTCCTATA TG^C^T 
GCAGGATTCT TAGTTGCCAT TGGTTTAGCA ATGGGGGGTG GTGTTCCTGA C^^A 
OCAGGAAAAT TCACTATCTG GGATGCTTTA GCAACTATGG G^GTAAa" ££££ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
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TTGCCAGTTG TTATTGCTAC AGGTTTGTCT 
CCAGGTTTTG TTGTTGGTCT AATTGCCAAT 
TTGGGAGGTT ATATAGTTGG TTTCTTGGTT 
AACTGGATTA AAGGTTTAAT GCCAACCTTG 
GTTTGATTAT GATTTATATT ATTGGGGCGC 
G 
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TACTCGATTG CTGGTAAGCC AGGGATTGCA 300 

TCTGTTGGTT CAGGGTTTAT CGGTGGTATC 360 

CAAGCGATTA TTAAAAAGGT CAAAGTACCA 420 

ATTATTCCTT TTGTACCTCT TTGGTAAGTA 480 

CTATCGCAGC CTTTACCAAC TGGTTGACGA 540 

541 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTGGAATGAG 


TGTATAAGCC 


CAGCCCAAGT 


TTTGCATCCG 


TTCAAAGTTC 


CAAGACCCTT 


60 


GTAAGAAGGT 


TGAACGCCAC 


CAAACTTTTT 


TACGATCTGA 


TTTAGTTAAT 


TGAAGTTTTT 


120 


CAGTCATGAT 


GTTTTCAGTC 


CTTTCTTATC 


TTAGTAGTCT 


TC . AGGATAT 


CGCCGATTGG 


180 


GTCGTTAGAA 


GTTGCGGCTC 


CTCCGCCACC 


ATTTCCACCA 


C-TTTAGAAA 


GGTGAAGGTA 


240 


GATAAGAGCG 


ATAGCAACAC 


CGATAGCACC 


GAATCCGATT 


AGAGTAATAT 


CTGACACAGC 


300 


AGCGAGAACG 


AAACCAAGAG 


CGAAGAATGG 


CCATACTTCA 


CGAGTTGCCA 


TCATGTTGAT 


360 


AACCATGGCG 


TAACCAACGG 


CAACGACCAT 


ACCACCACCG 


ATAGCCATAC 


CATCTTTGAG 


420 


CCAGTCTGGC 


ATGGCACTAA 


GGATACTTTG 


TACAGTTTCA 


GTTGGTACCA 


TAAGGAGAAG 


480 


AGCTGCAGGA 


AGCGCGATAC 


GAAGTCCTTG 


GAAAAGTAGC 


GCAATGAAAT 


GCGCACGCTC 


540 


CACAGCGCCG 


AAGTCACCTT 


TTTTAGCGGC 


AGCATCTGCA 


GTATGAACCA 


AACCAACTGA 


600 


AATTGTACGA 


ACAATCATTG 


TCAAGAAAAG 


TCCAGCTACG 


GCAAGAGGGA 


TAGCAACCGC 


660 


TTGGGCAACA 


CCGATACCAG 


TCTTGGTAAA 


GTCACCACCA 


AGAACCATGA 


TAATGGCAGC 


720 


AGCGACAGAA 


GCAAGTGCAG 


CATCAGGAGC 


GATAGCAGCA 


CCGATATTTG 


ACCAACCAAG 


780 


GGCAATCATT 


TGAAGCGATC 


CACCGAGGAT 


AATCCCTGCT 


TCCAAGTGGA 


CCTGTTACAA 


840 


GCCCAATAAG 


GGTACAGGCT 


ACAAGTGGTT 


GGTGAAATTG 


GAACTGGTCG 


AGGATGCCTT 


900 


CAAGACCTGC 


AAAGGAAGGC 


TACAACGACT 


ACTAAAACCA 


TAGAAATAAT 


AGACATGTTT 


960 


AAAATCCTTT 


CATAAATAAT 


GGCTTATTTG 


ACATTGGCTT 


TGTTAATCAA 


GTCAAACAAA 


1020 


TCTTTTTTAG 


AATCATTTGG 


TACTTTACGG 


ACATCAAATT 


CAACACCCAA 


GTCACGCATT 


1080 


TTTTCAAATG 


TAGCAACATC 


TTCTTTGTCC 


ATAGACAAAA 


CGGTATTGAC 


CAATGTTTTA 


1140 


CCTGTTGAGT 


GAGCCATAGA 


ACCAACGTTA 


AGAGTCTTGA 


TTGGCACGCC 


GCCTTCGATG 


1200 


GCACGAAGGG 


CATCTTGAGG 


TGTTTCAAAC 


AAGATAAGGG 


CATGTGTTTC 


TCCAAAACGT 


1260 


GGGTCTTTTG 


AAATATCAAT 


CAGTTTTTGA 


ATTGGAACCA 


CGTTAGCCTT 


GACATTACCT 


1320 


GGAG 












1324 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATAAGGAA GCTGTCGCTC GTTCCGCTAA GGTATGGACA CCACGGTGAA CATTGGCATT 
GTCCTGCTCA TAGTAACTGT TAATAGCTTT CAGAACTACT AGTGGTTTTT GTGTCGTCGC 
AGCATTGTCC AGATAGACCA GAGGTTCATC ATTGACAATC TGATCTAAAA TTGGAAAATC 
CTTGCGAATC GCTTCTACAT CTAACATAGG CTTCCCCTTA GCGTTTTGAC AATTTCTCTT 
CGATAGTTGC AATCATTTCA TCACGAACTT CCTTGACTGG AATCTCCACG ATAACAGATC 
CAAGGAAACC ACGAACAACC AAACGCTCTG CAGTTGCCTT ATCCAATCCA CGACTCATGA 
GGTAATACAT GTCTTCTGGA TCAACTTGTC CGATAGACGC TGCGTGTCCT GCAGTGACAT 
CATTTTCATC AATCAAAAGA ATTGGGTTAG CATCTGAACG CGCTTGGTCT GAAAGCATGA 
GAACACGGCT CTCTTGTTGC GCATCTGCTC CCTTAGCACC CTTGATGATG TGGCCGATAC 
CATTGAAAGT CAAAGTTGCT TTTTCAAGGA TAACCCCATG TTGTAGGATA TTTCCGATAG 
AGTTGCAGCC ATAGTTAGTT ACACGAGTAT CAATCCCTTG TACCTGACGA CCACTTGAAA 
GAG 

(2) INFORMATION FOR SEQ ID NO; 16: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
663 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 649 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



A^ZT GGCTTCAATA ^^TT TGGCrcCTCC ACTATTGGCA ATGAGCTGAA 
AACAGATATA TTCCATTTCT TCGTCATCTT ATTTCTCCTA TCCATTCAAG TGCTTGTTCC 
AGAACTTTTG CTCCATTCAT CATTCCGTAA TCCCGCATAT CAATGGTATC TACAGGGATA 
TTTCCTGCAA TTTCTTTCAC AGCAAGTAAC TCATAACGAA TTTGTGGCCC AATTAGAATG 
ACATCTGCTT CATGGATATT CTTTTTAGCT TCTGTCATTG ATTTTGCTTG GATAGAAATT 
TCAATCCCAC GTTCAGTCGC ACTTTGTTGC ATTTTTTTAA CAAGCATACT TGTCGACATT 
CCCGCATTAC ATACTAATAA AATTTGTTTC ATAATCTTAA CCTTCCATTT CTTGTTCAAC 
AACTTTGTCA TTAACTTTGA TAAATGGAAT GTATAGAAGA ACTCCAAGTG CAAAGATGAT 



60 
120 
180 
240 
300 
360 
420 
480 
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GAATTGAACT AGAACTGCTC TCACGTCCCC TGCTGTTGCT AACCATGCAT TTAAGAATAC 540 

TGGTGTAGTC CAAGGAACTT GTATAAATGC AGGACTCATG AATCCTGTAA CTGTTGCTAA 600 

GTAGCTGATT AAAATACCAA GGACTGGAAC TGTGATAAAT GGAATAGTC 649 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CTTGGTATTA 


TTTGAAATCA 


GATGGTTCTT 


ATGCAAAAAA 


TGCATGGCAA 


GGAGCTTATT 


60 


ACCTTAAATC 


AAACGGTAAA 


ATGGTACAAG 


GTGAGTGGGT 


TTATGATTCT 


TCTTACCAAG 


120 


CCATGGTATT 


ACTTGAAATC 


AGATGGTTCA 


TATGCTCGCA 


ATGCATGGCA 


AGGAAACTAC 


180 


TATTTGAAAT 


CAGATGGTAA 


AATGGCTGTC 


AATGAATGGG 


TTTATGATGC 


CACCTATCAA 


240 


GCATGGTATT 


ATTTGACATC 


AGATGGTTCT 


TATGCTTACA 


GTACATGGCA 


AGGAAATTAC 


300 


TATCCTAAAA 


TCGGATGGTA 


AAATGGCTGT 


r .ATGAATGG 


GTTGATGGTG 


GACGTTATTA 


360 


TGTTGGCGCT 


GACGGAGTTT 


GGAAGGAAGG 


TCAAGCAAGT 


ACAGCTTCTC 


CTAGTAATGA 


420 


TAGCAATAGT 


GAATATTCCT 


GCTGCTTTAG 


GAAAGGCAAA 


AAGTTATAAT 


TCGTTATTCC 


480 


ACATGTCAAA 


AAAAACG 










497 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTAGTCTAAA 


TTTTTTAAAA 


CAAAGGTCAA 


AGATAGTCAA 


TATCAGTAAT 


CATAACTAAG 


60 


TAAACAAAAA 


GAGGTAAAGA 


ATATGAATAA 


CAACTTTAAT 


AATTTTAATA 


ACATGGATGA 


120 


TTTATTTAAC 


CAATTGATGG 


GTGGTATGCG 


AGGATACAGT 


TCTGAAAATC 


GCCGTTACTT 


180 


AATTAATGGA 


CGCGAAGTCA 


CACCTGAGGA 


ATTTGCTCAC 


TATCGTACGA 


CTGGTCAATT 


240 


ACCAGGAAAT 


GCAGAAACTG 


ATGTGCAAAT 


GCCACAACAG 


GCATCAGGTA 


TGAAACAAGA 


300 


CGGTGTCCTT 


GCAAAACTAG 


GTCGAAACTT 


GACAGCAGAA 


GCGCGTGAGG 


GCAAGTTGGA 


360 
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TCCTGTTATC GGACGAAACA AGGAAATTCA AGAAACATCT GAAATCCTCT CACGCCGCAC 
CAAGAACAAT CCTGTTTTGG TCGGAGATGC AGGTGTTGGT AAGACAGCAG TTGTCGAAGG 480 
TCTAGCGCAA GCCATTGTGA ACGGAGATGT TCCTGCTGCT ATCAAGAACA AGGAAATTAT 540 
TTCTATTGAT ATCTCAGGTC TTGAGGCTGG TACTCAATAC CGTGGTAGCT TTGAAGAAAA 
TGTCCAAAAC TTAGTCAATG AAGTGAAAGA AGCAGGGAAT ATTATCCTCT TCTTTGATGA 
AATTCACCAA ATTCTTGGTG CTGGTAGCAC TTGTGGAGAC AGTGGTTCTA AAGGGCTTGC 
GGATATTCTC AGCCAATCGA TCTCTCTCGT GGAGAATTGA CAGTGATTGG GGCAACAACT 
CAAGACGAAT ACCGTAACAC CATCTTGAAG AATGCTGCTC TTGCTCGTCG TTTCAACGAA 
GTGAAGGTCA ATGCTCCTTC AGCAGAGAAT ACTTTTAAAA TTCTTCAAGG CATTCGTGAC 
CTCTATCAAC AACACCACAA TGTCATCTTG CCAGACGAAG TCTTGAAAGC AGCGGTGGAT „ u 
TATTCTGTTC AATACATTCC TCAACGTAGC TTGCCAGATA AGGCTATTGA CCTTGTCGAT 1020 
GTAACGGCTG CTCACTTGGC GGCTCAACAT CCAGTAACAG ATGTGCATGC TGTTGAACGA 1080 
GAAATCGAAA CGGAAAAAGA CAAGCAAGAA AAAGCAGTTG AAGCAGAAGA TTTTGAAGCA 1140 
GCTCTAAACT ATAAAACACG CATTGCAGAA TTGGAAAGGA AAATCGAAAA CCACACAGAA 
GATATGAAAG TGACTGCAAG TGTCAACGAT GTGGCTGAAT CTCTGGAACG AATGACAGGT 
ATCCCAGTAT CGCAAATGGG AG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



420 



600 
660 
720 
780 
840 
900 
960 



1200 
1260 
1282 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT ATCTATTCCT CCACTAGTAA 
ACGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG CCAATATTTA ACTTTATCTT 
TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA ATGAATCATT CTTCCTATCA 
ATCCATCCAT GATATTTTCT TGCTAAATAT AATGGAGTCT CATAGTGTGA AAGAGTTACA 
AGTGGTTCTA TCCCGTGAGC ATGTAGTTCA TCAAACAATT CATCATAATA TTTCAATCCA 
GCTTCGTTAG GTTCTTCCTC ATCTCCTTTT GGAAAAATTC TACTCCATGC AATAGAAGTA 
CGAAAAACAT TAAAGCCCAT TTCAGAAAAC AAGGATATAT CTTCCTTATA TTTATGATAA 
AAATCAATAC CTATCAATTT TAAGTTATCT TCTGTAGGAT TTTCTGTTGC TTCTCCTAAT 
CCACCTTTGG GTAACACATC CTGAACTGAT AAGCCCTTAC CATCTTCATT ATATGCTCCC 
TCTACTTGAT TAGCTGCAAC AGCTCCACCC CAAAGAAAAT CATCTGGAAA AATGGTCATA 
ACTTTCCTCC ATTATAATAT TACCAGTAAT TCCTTAGAAA TGCTCGATTG TCTGATTATT 
AGGTAATATT AATACATCTA GAAAATCATT GGTATTCGTT ACAATTACTG GTGTAACTGT 
TTCGTAGCCT TTAGTCTTGA TTAAATTCAA GTCCATTTCA AAAATCAACT GATTTTTGAA 
AACTCTGTCT CCTTCTTCTA CATGACTAAT AAAACCTTGA CCTTTTAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
828 
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(2) INFORMATION FOR SEQ ID NO: 20: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGCTATGGTC 


AGACTCAGAT 


TGATGGCGTT 


GCTTATGCCA 


AGTACGATAT 


CTTCCGTTTA 


60 


AAGAACGGGA 


AAATTGTGGA 


GCATTGGGAT 


AATAAGGAAG 


TCATGCCTAA 


GGTAGAAGAC 


120 


TTGACCAATC 


GAGGGAAGTT 


TTAAATTGAG 


GACAAAGAAT 


GATTGAATAC 


AAAAATGTAG 


180 


CACTGCGCTA 


CACAGAAAAG 


GATGTCTTGA 


GAGATGTCAA 


CTTACAGATT 


GAGGATGGGG 


240 


AATTTATGGT 


TTTAGTAGGG 


CCTTCTGGGT 


CAGGTAAGAC 


GACCATGCTC 


AAGATGATTA 


300 


ACCGTCTTTT 


GGAACCAACT 


GATGGAAATA 


TTTATATGGA 


TGGGAAGCGC 


ATCAAAGACT 


360 


ATGATGAGCG 


TGAACTTCGT 


CTTTCTACTG 


GTTATGTTTT 


ACAGGCTATT 


GCTCTTTTTC 


420 


CAAATCTAAC 


AGTTGCGGAA 


AATATTGCTC 


TCATTCCTGA 


AATGAAGGGG 


TGGAGCAAGG 


480 


AAGAAATTAC 


GAAGAAAACA 


GAAGAACTTT 


TGGCTAAGGT 


TGGTTTACCA 


GTAGCCGAGT 


540 


ATGGGCATCG 


CTTACCTAC I 


GAATTATCTG 


GTGGAGAACA 


GCAACGGGTC 


GGTATTGTCC 


600 


GAGCTATGAT 


TGGTCAGCCC 


AAGATTTTCC 


TCATGGATGA 


ACCCTTTTCG 


GCCTTGGATG 


660 


CTATTTCGAG 


AAAACAGTTG 


CAGGTTCTGA 


CAAAAGAATT 


GCATAAAGAG 


TTTGGGATGA 


720 


CAACGATTTT 


TGTAACCCAT 


GATACGGATG 


AAGCCTTGAA 


GTTGGCGGAC 


CGTATTGCTG 


780 


TCTTGCAGGA 


TGGAGAAATT 


CGCCAGGTAG 


CGAATCCCGA 


GACAATTTTA 


AAAGTGCCTG 


840 


CAACAGACTT 


TGTAGCAGAC 


TTGTTTGGAG 


GTAGTGTTCA 


TGACTAATTT 


AATTGCAACT 


900 


TTTCAGGATC 


GTTTTAGTGA 


TTGGTTGACA 


GCTACAATGA 


CATTGGTCGG 


TTCCTTGAGC 


960 


AAGAGATAGA 


TTAGCCAGAC 


AGTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


CATAGACCGT 


1020 


TGCACATTAA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAAGAA 


AACTTGCTAC 


TGCTTGTAAT 


1080 


AAAAAGGTTG 


TTAGTGTCAT 


ATTAGTTCAT 


CAATACCAAG 


GCGACAGAAG 


TTCCTGCCCC 


1140 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


TTATGTGGTT 


1200 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


GCATGACCGC 


1260 


ACCAGCTATA 


ATCAAATCCT 


GCCCGTTTGA 


ATGGAAAAAC 


CCTGTGTTAG 


CCGAGCCCCA 


1320 


AAACTGGGGC 


CAATTTATCC 


CCCAAAGACA 


AAAGCTCCAT 


CAAAGGCTGT 


CACAAAGGGA 


1380 


ATTCGGATAA 


ATTTTCCACA 


TAGAAGGAAA 


AGGCAAAACC 


AAATAAGGTC 


GCCACTCCTG 


1440 


CCCCAAGTGC 


TCGTAAATAT 


TCCGCT 








1466 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 97/37026 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



AAGGTTCTTC 
AATCTGCACA 
GTCGTGAGTC 
GATGGGACTC 
CATCGCATTA 
CAAATATAAC 
AACGTGGCTA 
CAGGAGCAGC 
CAGGTGTTCG 
CGTACGTTGA 
CTGGTGACAT 
CAATTGGTGA 
TACCTGCTGA 
TTTATACACC 
ATGGTTTCGA 
CTTTAGGGCT 
AGATTGCTAA 
TAGATCGTAC 
CCGGTATCAA 
GAAATCTTGG 
GATTAAAAGA 



GTAAATGTAC 
AGAACACGGT 
AGCTATTTCG 
CAGTGCCAAC 
CACTGCTTTT 
AAAAATTGAT 
CGGTACAGCT 
TGTGACATCT 
TGAAGACGTG 
AGACGAAAAA 
TTTGACAGAT 
AGGTTCTTCT 
TGAAAATAAA 
AGTTACAAAA 
CAAATTAACC 
TTCAGCACGT 
GTCAACTGAA 
GATTGAGGAA 
TACTGTGCAT 
ACGCAAGAGT 
TAAATAAAGG 



ACCATTCGGA 
CTTAAATCAG 
TGCGCTTGCT 
ACAATGGTGC 
CGTTTAAGAG 
GAAAATAAAG 
CTTGGTAACT 
ATCAACATTG 
ATGCAAATCA 
ATCATCGAAC 
AGCGATATTG 
CTAAAAGCGA 
AAGGATAATG 
GTCAACTATC 
CTTGAAATCT 
ATTTTGACAG 
GTGATGAAAG 
CTGGACTTGT 
GATTTGACAG 
TTGGAAGAAG 
AGGAATACAT 



TGCTCAAAAT 
TTGAAGTTAC 
GCCGCTGGTC 
TTCGTCCTCC 
GGAGTAACTA 
ATTATGGCAA 
CTCTTCGTCG 
ATGGTGTGTT 
TTCTGAACAT 
TGGATGTTGA 
AAATTGTAAA 
CTATGACTGT 
CACCAGTTGG 
AAGTGGAACC 
TGACAAATGG 
AACATCTTGA 
AAGCTGATAC 
CTGTGCGTTC 
AAAAATCTGA 
TGAAACTCAA 
GGCTTACCGT 



GGGTTCTGAA 
TGTAAAAGGT 
TTGAAGTAAC 
AAAACGTCGC 
AATGATCGAG 
GTTAGTAATC 
TGTACTTCTA 
ACATGAGTTT 
TAAAGGAATT 
AGGTCCTGCT 
TCCAGATCAT 
TAACAGTGGT 
AACACTTGCT 
TGCTCGTGTA 
AACAATTATT 
TTTGTTTACA 
TGAATCTGAC 
ATACAACTGT 
AGCAGAGATG 
ACTCATTGAT 
AAACTAGGAC 



GCTGCTGCTA 
CCAGGTTCTG 
AGCAATTCGT 
CGTGTATAAT 
TTTGAAAAAC 
GAACCACTTG 
GCTTCTCTAC 
GACACAGTTC 
GCAGTGAAAT 
GAAGTAACAG 
TATCTCTTTA 
CGTGGATATG 
GTAGATTCTA 
GGTAGCAATG 
CCAGAAGATG 
AATCTTACTG 
GACCGTATTT 
TTAAAACGTG 
ATGAAAGTAC 
TTGGGTCTTG 
GCACTAG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1257 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTATTGAAAC AAGAAAAATA GAGAATCAAA GAAAGAGAAC TTATGAATAT TCAAGAAGAA 
ATTAAGAAAC GTCGTACCTT TGCCATTATC TCCCACCCGG ACGCGGGGAA AACAACCATC 
AC^GCAGT TACTCTAACT TTGGGGGTGA GATTCGTGAG GCTGGTACGG TAAAAGGGAA 
GAAAACAGGG ACTTTTGCTA AATCTGACTG GATGGATATC GAGAAGCAAC GTGGGATTTC 
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TGTTACTTCA 


TCTGTTATGC 


AATTTGACTA 


CGACGGCAAG 


CGCGTGAATA 


TCTTAGACAC 


300 


GCCAGGGCAC 


GAGGACTTCT 


CAGAAGATAC 


CTATCGTACC 


TTGATGGCGG 


TGGATGCTGC 


360 


GGTCATGGTC 


GTGGACTCTG 


CCAAGGGGAT 


CGAGGCTCAA 


ACAAAGAAAT 


TGTTTGAGGT 


420 


TGTGAAACAT 


CGTGGCATTC 


CAGTCTTTAC 


CTTTATGAAC 


AAGTTGGATC 


GTGACGGTCG 


480 


TGAGCCTTTG 


GATCTCTTGC 


AAGAATTGGA 


AGAAATCTTG 


GGCATTGCTA 


GCTACCCTAT 


540 


GAACTGGCCT 


ATCGGGATGG 


GGAAAGCCTT 


TGAGGGCTTG 


TATGACCTCT 


ATAACCAACG 


600 


TTTAGAGCTT 


TACAAAGGGG 


ATGAGCGTTT 


TGCTAGCCCT 


AGAAGATGGA 


GACAAACTTT 


660 


TTGGTAGCAA 


TCCTTTCTAC 


GAGCAAGTCA 


AGGATGACAT 


TGAGCTTTTA 


AATGAAGCTG 


720 


GGAATGAGTT 


TTCAGAGGAA 


GCTATTCTGG 


CTGGAGAATT 


GACGCCTGTC 


TTTTTCGGTT 


780 


CAGCCCTGAC 


AAACTTTGGT 


GTGCAGACCT 


TCCTTGAAAT 


CTTCCTCAAG 


TTTGCTCCAG 


840 


AACCACATGG 


TCACAAGAAA 


ACAGACGGTG 


AAATTGTGGA 


TCCTTATGAC 


AAGGATTTCT 


900 


CAGGCTTTGT 


CTTTAAAATC 


CAAGCCAACA 


TGGATCCTCG 


TCACCGTGAC 


CGTATTGCCT 


960 


TTGTCCGTAT 


CGTATCAGGC 


GAATTTGAGC 


GTGGCATGAG 


TGTCAATCTC 


CCTCGTACTG 


1020 


GTAAGGGTGC 


CAAACTATCT 


AATGTTACCC 


AGTTTATGGC 


GGAGAGTCGT 


GAGAATGTGA 


1080 


CCAATGCCGT 


AGCAGGTGAT 


ATTATCGGGG 


TTTACGATAC 


CGGTACTTAT 


CAGGTTGGGG 


1140 


ATACCTTGAC 


GGTTGGAAAA 


AACAAGTTTG 


AATTTGAACC 


ACTGCCAACC 


TTTACTCCTG 


1200 


AAATTTTCAT 


GAAAGTTTCT 


GCTAAGAATG 


TTATGAAGCA 


AAAATCCTTC 


CACAAGGGGA 


1260 


TTGAGCAATT 


GGTGCAAGAA 


GGAGCCGTTC 


AGCTTTATAA 


GAATTACCAA 


ACAGGTGAGT 


1320 


ACATGCTGGG 


AG 










1332 



'2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



TTGGACGCTG 


GAACAAATCC 


AAGCTGACCT 


GTTCCAAGAC 


CAGACTTGGT 


ATGCTCTGGC 


60 


TTATGATGGG 


GCAGAAGTGA 


TTGGCTTTCT 


AGCTGTTCAG 


GAGACTCTTC 


TTTGAAGCAG 


120 


AAGTCCTGCA 


AATCGCTGTC 


AAAGGAGCCT 


ATCAGGGTAA 


GGGCATTGCG 


TCAGCCTTGT 


180 


TTGCTCAATT 


GCCGACAGAC 


AAGGAAATTT 


TCCTCGAAGT 


CAGACAGTCA 


AATCAACGAG 


240 


CGCAAGCATT 


TTACAAGAAA 


GAAAAGATGG 


CAGTTATCGC 


TGAGCGAAAG 


GCCTACTACC 


300 


ATGACCCAGT 


CGAGGACGCC 


ATTATCATGA 


AGAGAGAAAT 


AGATGAAGGA 


TAGATATATT 


360 


TTAGCATTTG 


AGACATCCTG 


TGATGAGACC 


AGTGTCGCCG 


TCTTGAAAAA 


CGACGATGAG 


420 


CTCTTGTCCA 


ATGTCATTGC 


TAGTCAAATT 


GAGAGTCACA 


AACGTTTTGG 


TGGCGTAGTG 


480 


CCCGAAGTAG 


CCAGTCGTCA 


CCATGTCGAG 


GTCATTACAG 


CCTGTATCGA 


GGAGGCATTG 


540 


GCAGAAGCAG 


GGATTACCGA 


AGAGGACGTG 


ACAGCTGTTG 


CGGTTACCTA 


CGGACCAGGC 


600 


TTGGTCGGAG 


CCTTGCTAGT 


TGGTTTGTCA 


GCCGCCAAGG 


CCTTTGCTTG 


GGCTCACGGA 


660 


CTTCCACTGA 


TTCCTGTTAA 


TCACATGGCT 


GGGCACCTCA 


TGGCAGCTCA 


GAGTGTGGAG 


720 
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CCTTTGGAGT TTCCCTTGCT AGCCCTTTTA GTCAGTGGTG GGCACACAGA GTTGGTCTAT 780 

GTTTCTGAGG CTGGCGATTA CAAGATTGTT GGAGAGACAC GAGACGATGC AGTTGGGGAG 840 

GCTTATGACA AGGTCGGTCG TGTCATGGGC TTGACCTATC CTGCAGGTCG TGAGATTGAC 900 

GAGCTGGCTC NTCTNGGGCA GGANATTTAT GA 932 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1158 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 24: 

CTAAACTTTC GCTCATAGGC ATACAAATTA ATCCTTTGGC ATAAGTAGCC ATAAAATTAA 60 

CATTTTCTGT TGTAGCTGCT TGTGCAGAAC AAATTAAGTC TCCTTCATTT TCTCTATCCT 12 0 

TGTCGTCTAT AACAAGAACA AGTCGTCCCT TCTGCAATGC TTCTAATGCT TCTTGTATTT 180 

TTCGATATTC CATTGACTGA TTATCCTTTC TGCTAAAATC CATTTTGATA TAATAGTTCC 240 

TTCGATATTT CTGATTTTGG AGAGTTATCC ATCAGTTTTT GCACATATT" ACCTAAGATA 300 

TCATTTTCAA GATTTACTGT ACTCCCGACT TGTTTACTCT TAAGAATGGT TTGTTTCAAG 360 

GTATGAGGGA TAACAGATAC TGAAAAGTTT ACTTTGGAGA CTTTAGCGAC AGTCAGACTA 420 

ATGCCGTCAA TTGTAATAGA TCCTTTTTCA ACTATTAAAT CTAAAATTTC TTTTTGTGTG 480 

TTGATTTGAT ACCATACAGC ATTATCATCT TTTTTTATTG ACGAGATTTT TCCTGTACCA 540 

TCAATGTGTC CTGTAACGAC GTGACCCCCA AGTCGACCGT TGACAGATAA GGCTCTTTCT 600 

AGATTCACCT CACTTCCATG TTTTAATAGA GTAAGAGCTG TTCGACTCCA TGTTTCATTC 660 

ATTACATCAA CTGTAAAGGA TTGATGATTG AAATGAGTAA CTGTAAGACA GATACCATTT 720 

ACTGCTATAC TATCGCCTAA ATGGATATCC GTTAATATTT TTGAGGCTTT AATTGATAGT 780 

TTACAATTAC GAGAGTCTTT CTGTATTCCT TCAACTTTTC CGATTTCTTC AATTATTCCT 840 

GTGAACATGG ATAAATCACT TCACTTTCTA TGAGATAGTC NTTTCCTNTT TGAGAAAAAG 900 

CATAAGGTTT CAATCTAATA GCGTCATTTG GCAAAGAAAT GCCTTCACCT CCGACAGGAA 960 

ACTTGGCACT GCCTCCAAAA ATTTTTGGTG CAATATATAT TTTCAGCTCA TCAACAATTT 1020 

GTTGTTCCAA AGCACTCCAA TTCATTAGAC TGCCCCCCTC TACAACTAGG CTATCAATCT 1080 

GCATGTTTCC TTAGATGTTN GCATTAAACT CNGATAAGTC TATATGATTG CCTTTTTTCT 1140 

TTATGGAAAG TATTCCCC 1158 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



TAAAGGCTGT 


TTGGATAATG 


TAACATAAAG 


AAATCTTGTT 


TAGCAGGGAT 


TGAATAGAGT 


60 


TCCTGAAGTG 


ATGACTTCTT 


TATTAGACAT 


AATGTCTGGA 


CCCAAGATTA 


AAAGAATGGC 


120 


CCCGAAGAAT 


ACGAGCATCA 


AAGGTAGCAG 


ATGCAACAAC 


TGTATCGTGG 


AAGATTGAGA 


180 


GGAACTTAGG 


TAATTTAAGA 


TTGTCTAAAC 


TTTCTTCTTT 


CTTACCAAAG 


CGTCCTGCTA 


240 


CTTTATCTAC 


AAACCAGATT 


GCAAATTGCT 


GTTGGTGACC 


AATCGCAAAT 


CCGCCACCAC 


300 


CAGTCAAGCG 


TTGAGTTGCC 


TCAACAGTCA 


TATTTGAACT 


AACTGCCCAG 


TAAAGTCCAC 


360 


AGATGATACC 


AATCGCTGCT 


GTACCGTAAG 


CATTGCGCAA 


TTGTGGTACT 


AAGAATAGAA 


420 


CCATAAGAGA 


TACTGTTGCA 


GCTAGCCATG 


CTGATCAATT 


CCCTCAACCA 


GTCCTATATC 


480 


TCAGCCAAGA 


GAATCGAGGA 


GGTCTTTGCG 


GAATCTCCCG 


AAAACATCCA 


TTCAGAATTA 


540 


GAACAAAAGC 


AAGTTACCAG 


TGGTCGGGTT 


TTACAAGTCC 


AAGAATTGAC 


TTTTACCTAT 


600 


CCTGATGCGG 


CCCAGCCTTC 


TCTGAGAGAC 


ATTTCCTTTG 


ATATGACTCA 


AGGACAAATC 


660 


CTTGGTATCA 


TTGGGGGGAC 


TGGTTCTGGT 


AAATCAAGCT 


TGGTGCAACT 


CTTACTTGGA 


720 


CTTTATCCAG 


TAGACAAGGG 


GAACATTGAC 


CTTTATCAAA 


ATGGACGTAG 


TCCTCTTAAT 


780 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


GTACCTCAAA AGGTCAAACT 


CTTTAAGGGA 


840 


ACTATTCGTT 


CCAACTTGAC 


TTTAGGTTTA 


AATCAAGAAG 


TATCTGACCA 


GAAACTCTGG 


900 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


TTTGTCAGTG 


AAAAGGAAGG 


ACTTTTGGAT 


960 


GCCCTAATTG 


AAGCAGGGGG 


GCGAAATTTC 


TCAGGTGGAC 


AAAAACAAAG 


GTTGTCTATC 


1020 


GCCCGAGCAG 


TCTTGCGCCA 


AGCTCCGTTT 


ATCATCCTAG 


ATGATGCAAC 


CTCGGCACTG 


1080 


GATACCATTA 


CAGAGTCCAA 


GCTCTTGAAA 


GCTATTAGAG 


AAAATTTTCC 


AAACACGAGC 


1140 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


TCTCCTCTTG 


1200 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


CAGCCAAGTC 


1260 


TATCGTGAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


GACAAACTGT 


1320 


AAACCAGACG 


CTCAAACGTT 


TAGCCGTAGA 


TTTAGCAAAC 


CATCCCTTCC 


TCCTTTTCCT 


1380 


AGCCTTTCTA 


GGAACTATTG 


CCCAAGTTGG 


CTTATCAATT 


TACCTACCTA 


TTCTGATTGG 


1440 


GCAGGTCATT 


GACCAAGTCC 


TAGTGGCTGG 


TTCATCACCA 


GT 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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CTCTGAAGGA TCCAGAAAGT CGCTTATATA TTTTATTAAA AGACGGTCAG GTTATTGGGA 60 

CTTGTACGGT TGATTTATCG ACTAATACGA ATTACTTCTA CGGTTTAGCA ATATTGGAAC 120 

CTGAACGTGG AAAAGGCTAT GGAAGCTACT TAGCAAAATC CCTCGTCAAC CAACTAATTG 180 

AGCAAAATGA CAAGGAATTT CAGATTGCAG TGGAAGATAG CAATGTAGGT GCCAAACGTT 240 

TGTATGAAAA AATTGGCTTT GTCAAACAGA CTCAGGTGGT TTATCTGAAT GAGAAAGGAG 300 

CAAGGGATTC CGAAGTGTAG AGATATTCGG ACTGAAATTT ACTTGAACTT TTAGTGATGA 360 

AACTAATTGT TCTTGGATTT CAGCTTTCCT GATTATGATT TATGATTAAA ATCTATGACA 42C 

CCATGTCTCG TGATTTGCGA GAATTTGTCC CGATTGAGGA CGGCAAGATC AAGATGTATG 480 

TTTGTGGGCC AACGGTGTAC AACTATATCC ACGTGGGAAA CGCCCGTTCG ACGGTAGCTT 540 

TTTTGGATAC GAATTTCGTC GCTATTTTGA GTACCGTGGG TATAAGGTTG CCTATATTTC 600 

CAATTTTACA GATGTGGATG ATAAGATTAT CAACCGTGCC AGGGAAGAAG GCATCACGCC 660 

TCAGGAGGTT GCGGATAAGT ACATCGCTGC CTTTCGTGAG GATGTGACGG CCTTGGGCGT 720 

GAAACCTGCG ACTCGCCATC CGCGTGTAGT GGAGTTTATG GCAGACATCA TCCGTTTTGT 780 

GGAAGACTTG ATCGAGAAAG GCTTTGCCTA TGAGAGTCAA GGGGATGTCT ATTTCCGTGT 840 

AGAAAAATCC CACAACTATG CTAAATTGGC TAATAAAACC TTGGAAGATT TGGAG 895 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CTGTTTGCCC TTTCTTCTAA TAAAGAATTG GCAGAACGTG TGGCGCAGGA GATTGGGATA 60 
GAGTTGGGGA AATCAAGTGT TCGCCAATTT TCAGATGGAG AGATTCAGGT CAACATTGAA 120 
GAATCAATCC GTGGGAAACA CGTCTTTATC CTACAATCAA CTAGTTCGCC TGTAAATGAC 180 
AATCTGCTTG AAATTTTGAT TATGGTAGAT GCTTTGAAGC GTGCGAGTGC AGAATCTGTC 240 
AATGTTGTCA TGCCTTACTA TGGGTATGCA CGTCAGGATA GAAAGGCGAG AGCGCGTGAG 
CCAATCACTT CAAAACTTGT CGCAAATATG CTTGAAGTAG CTGGAGTGGA TCGTTTATTG 
ACCATCGACT TGCATGCTGC GCAAATTCAA GGATTCTTTG ATATTCCTGT GGATCATTTG 
ATGGGTGCTC CTCTGATTGC AGATTATTTT GAGCGTCCTG GTATGGTTGG TTCTGACTAT 
GTGGTTGTCA GCCCGGACCA TGGAGGGGTG ACTCGTGCTC GTAAGTTGGC AGAATTTTTG 
AAAACATCTA TCGCTATTAT TGAGAAACGT CGTAGCGTTG ATAAGATGAA TACTAGTGAA 
GTTATGAACA CCATCGGTAA GGTTGAAGGC AACCACTTGT AGCTTCGATT GATGATATGT 
ATTGATACCG CTGGAACGAT TTGTCATGCG GCAGATGCTC TTGCGGAAG 



300 
360 
420 
480 
540 
600 
660 
709 



(2) INFORMATION FOR SEQ ID NO: 28: 



fi) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



CTAGAAGAAC 


TCCGTCACAA 


GCTTGTGACT 


TGTAAGCAAG 


AACAGAGCAA 


GATTGAGGAT 


60 


TCCTTATGAT 


TTCATTCCTT 


CTTCTATTGG 


TCTTGGTTTG 


GGGATTTTAT 


ATCGGCTATC 


120 


GGAGAGGCCT 


GCTCTTACAG 


GTTTATTACC 


TGATTTCAGC 


CATGGCATCG 


GCTTTTATGG 


180 


CTGGCCAGTT 


TTATAAGGGG 


CTTGGAGAGC 


AATTCCATTT 


ATTGCTCCCT 


TATGCAAATT 


240 


CGCAGGAAGG 


TCAGGGGACT 


TTCTTTTTCC 


CATCGGATCA 


ACTCTTTCAG 


CTGGATAAGG 


300 


TCTTTTATGC 


AGGTATCGGC 


TACTTGCTTG 


TATTTGGGAT 


TGTCTATAGC 


ATTGGTCGTT 


360 


TGCTTGGTCT 


TCTCTTACAC 


TTGATTCCTA 


GCAAAAAACT 


GGGTGGTAAG 


TTGTTCCAAG 


420 


TTTCAGCAGG 


TATCTTGTCC 


ATGTTGGTGA 


CCTTATTTGT 


CTTGCAAATG 


GCCTTGACAA 


480 


TCTTGGCGAC 


CATCCCCATG 


GCAGTTATAC 


AAAATCCTCT 


TGAAAAGAGT 


ATCGTCGCAA 


540 


AACACATCAT 


CCAGAGCATA 


CCGATAACAA 


CCAGTTGGCT 


CAAACAAATC 


TGGGTGACAA 


60C 


ATTTAATCGG 


ATAAAAAGGG 


CAGGAGTTTT 


CCTAGCCCTT 


TGTTTACAGA 


TTTGACTCGA 


660 


ATCTATCAGA 


ATGTAAAAAG 


CTACCACACC 


TAGACATTCA 


AAGACAAGGA 


AATAAAGATG 


720 


AATAAGAAAA 


TATTAGAAAC 


ATTAGAGTTC 


GATAAGGTCA 


AGGCCTTGTT 


TGAGCCTCAT 


780 


TTGTTGACCG 


AGCAGGGCTT 


GGAGCAATTG 


AGACAG 






816 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CTTGAAGCAG 


CTGAAAGCAT 


GGGGTCTTCG 


GGATTTAAAA 


CCTATTCGTA 


ACGGGTTGTT 


60 


TTACCTTTCC 


TAGTTCCAAC 


CTTACTAGCA 


GCTCCTTGCT 


TGTATTTATG 


AGAGCATTCT 


120 


CAGACTTTGG 


AACGCCTATG 


TTGATTGGCG 


AAGGATATCG 


GACTTTCCCT 


GTCCTGATTT 


180 


ATACCCAATT 


TATTAGCGAG 


GTTGGAGGAA 


ATTCTGCTTT 


TGCAATTATG 


GCGATTATCA 


240 


TTGCCTTGGC 


AATTTTCCTT 


ATCCAAAAAC 


ACATTGCAAA 


CCGCTACAGT 


TTCAGCATGA 


300 


ATCTGCTCCA 


TCCAATTGAG 


CCTAAAAAAA 


CTACAAAAGG 


AAAAATGGCT 


GCCATTTATG 


360 


CAACAGTCTA 


CGGAATTATC 


TTTATCTCTG 


TTTTACCTCA 


AATCTACTTA 


ATTTATACCT 


420 


CTTTCCTAAA 


AACATCAGGT 


ATGGTATTTG 


TTAAAGGTTA 


TTCTCCAAAC 


AGTTACAAGG 


480 
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TAGCTTTCAA TCGTATGGGA TCTGCTATTT TCAATACCAT TCGTATCCCT TTGATTGCCT 540 

TAGTTCTAGT TGTTCCTATT TACGACATTT ATCTCCTACC TAGCCGTTAG AAAACGGAAT 600 

TTGTTTACAA ACTTAATTGA CAGCCTCAGT ATGGTACCTT ATATTGTACC AGGAACCGTT 660 

CTAGGGATTG CCTTCATTTC TTCCTTCAAT ACTGGTCTAT TTGGAAGTGG ATTTCTTATG 720 

ATTACAGGGA CTGCTTTCAT CTTGATTATG TCTCTATCTG TCAGAAGATT ACCGTATACT 780 

ATTCGCTCAT CTGTTGCTAG CTTACAACAA ATAGCACCAA GTATTGAAGA AGCTGCTGGA 840 

AAGCTTAGGA AGTAGTCGTC TCAATACCTT TGCTAAGATT ACAACTCCAA TGATGCTATC 900 

TGGTATCATT TCTGGAGCCA TCTTATCTTG GGTCACAATG ATTTCAAAAC TCTCTACTTC 960 

TATCCTCCTC TACAATGTCA AAACAAGAAC AATGACTGTA G 1001 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 30: 



60 



CTGGTTTGCA ATCTATTCCA ACTGAGA1TA CAGAGGCAGC AAGGATTGAT GGTGCGACTA 

GCAAGCAAGT TTTCTGGAAC ATTGAATTGC CTTACTTGCT ACCAAGTGTC TCTATGGTCT 120 

TTATCCTAGC CCTAAAAGGT GGGCTGACTG CCTTTGACCA AGTCTTTGCC ATGACCGGTG 180 

GTGGTCCAAA CAATGCCACA ACCTCACTTG GGCTCTTGGT TTATAACTAT GCCTTTAAAA 240 

ACAACCAATT CGGTTATGCC AATGCCATTG CCGTAATCTT GTTCCTCTTA ATTGTAGTGA 300 

TTTCGATCAT CCAATTGAGA GTATCTAAGA AATTTGAAAT TTAAGAGGAG AAGCATGATG 360 

AAACAAGATG AAAGAAAAGC CCTGATTGGC AAATACATTC TATTGATTCT AGGATCGGTT 420 

CTGATTTTAG TGCCGCTCCT TGCTACCCTC TTTAGTTCCT TTAAACCCAC TAAGGATATT 480 

GTAGATAATT TCTTTGGCTT TCCAACCAAC TTCACATGGG ACAACTTTAG CCGTCTCTTA 540 

GCTGATGGGA TTGGAGGCTA TTATTGGACT CTGTCGTCAT CACTGTCTTG TCTTTACTTG 600 

CAGTAATGAT CTTTATCCCT ATGGCAGCCT ACTCCATCGC TCGCAATATG AGTAAAAGAA 660* 

AAGCCTTTAC CATTCATGTA TACCCTCTTA ATCCTCGGAA TCTTCGTACC TTTCCAAGTC 720 

ATCATGATTC CGATTACGGT TATGATGAGT AAACTCGGTT TGGCTAATAC CTTTGGTTTG 78C 

ATCTTGCTCT ACTTGACCTA TGCGATTCCA CAGACCCTCT TTCTCTATGT TGGATATATC 840 

AAAATCTCGA TTCCAGAAAG TCTGGATGAA GCAGCAGAGA TCGATGGGGC TAATCAATTT 900 

ACAACCTATT TCCGCATCAT NTTCCCAATG ATGAAACCGA TCCATGCGAC AACCATGATC 960 

ATCAATGCCC TTTGGTTCTG GAATGACTTC ATGTTGCCAC TCCTTGTCTT GAACCGGGAT 1020 

TCCAAAATGT GGACTCTGCC TTTGTTCCAA TACAACTACG CAGG 1064 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CTGTAAAAGA 


AGTTAATCAA 


GTTGATAAAG 


CACTTTTAAA 


ACAAATTGGT 


GCAGTTGATG 


60 


TCTTAGAAGT 


GAAGGGTGGC 


ATTCAAGCAA 


TCTATGGAGC 


AAAAGCAATC 


TTATATAAAA 


120 


ATAGTATTAA 


TGAAATTTTA 


GGTGTAGATG 


ATTAAGTACT 


TACTGACTTA 


ATAAAAAACA 


180 


GAGGAGAGTG 


ATGGATGAGT AGGATGAAAT 


GAAATCGCAT 


ACAAGAAATA 


AAGAACTCAT 


240 


TATCCAAGTT 


GGATACGCTT 


ATTACATAGG 


AGAATACAAA 


TGAAATTTAG 


AAAATTAGCT 


300 


TGTACAGTAC 


TTGCGGGTGC 


TGCGGTTCTT 


GGTCTTGCTG 


CTTGTGGCAA 


TTCTGGCGGA 


360 


AGTAAAGATG 


CTGCCAAATC 


AGGTGGTGAC 


GGTGCCAAAA 


CAGAAATCAC 


TTGGTGGGCA 


420 


TTCCCAGTAT 


TTACCCAAGA 


AAAAACTGGT 


GACGGTGTTG 


GAACTTATGA 


AAAATCAATC 


480 


ATCCAAGCGT 


TTTGAAAAAG 


CAACCCAGAT 


ATAAAAGTGA 


AATTGGAAAC 


CATCCACTTC 


540 


CAGTCCAGTC 


CTGAAAAATC 


ACAACAGCCA 


TCCGAAGCAG 


GAACAG 




586 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1534 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



TGGTAGTTCT 


TCCTAGTGCA 


GGGGCTGTAG 


ACCCAGTTGC 


GACCCTAGCG 


CTGGACTAGT 


60 


CGAGAGGGTG 


TTGTTGAAAA 


TGGATGGNTA 


TCGCTATGTT 


GGTTATCTAT 


CAGGTGACAT 


120 


CCTCAAAACG 


CTTGGCTTGG 


ACACTGTTTT 


AGAAGAAACC 


TCAGCAAAAC 


CTGGAGAGGT 


180 


GACTGTAGTC 


GAAGTTGAGA 


CTCCTCAATC 


AACAACAAAT 


CAGGAGCAAG 


CTAGGACAGA 


240 


AAACCAAGTA 


GTAGAGACAG 


AGGAAGCTCC 


AAAAGAAGAA 


GCACCTAAAA 


CAGAAGAAAG 


300 


TCCAAAGGAA 


GAACCAAAAT 


CGGAGGTAAA 


ACCTACTGAC 


GACACCCTTC 


CTAAAGTAGA 


360 


AGAGGGGAAA 


GAAGATTCAG 


CAGAACCATC 


TCCAGTTGAA 


GAAGTAGGTG 


GAGAAGTTGA 


420 


GTCAAAACCA 


GAGGAAAAAG 


TAGCAGTTAA 


GCCAGAAAGT 


CAACCATCAG 


ACAAACCAGC 


480 


TGAGGAATCA 


AAAGTTGAAC 


CACCAGTAGA 


ACAAGCAAAA 


GTCCCAGAAC 


AACCCGTGCA 


540 


ACCTACACAA 


GCTGAGCAAC 


CAAGTACACC 


AAAAGAATCA 


TCACAACAAG 


AAAATCCTAA 


600 


AGAAGATAGG 


GGAGCGGAAG 


AGACACCGAA 


ACAAGAAGAT 


GAACAGCCAG 


CAGAAGCCCC 


660 


AAGAAATCAA 


GGTTGAAGAA 


CCAGTAGAAT 


CAAAAGAGGA 


GACTGTTAAT 


CAACCTGTTG 


720 
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AACAACCAAA AGTGGAAACG CCTGCTGTAG 
AAGTTGAAGT AACAAGTATT CCCCAAACTA 
ACGGAACGCG TGAAGTTGTT AAGGAAGGTA 
ATATCTTGAA TGCGACAGAT GGTACGACTA 
AGATGGAGAA AGAGGTTGTT CGTGTTGGCA 
TAAGTTTGAC AAGTGTTACA GATAATGCAA 
TGGAAAATAC AGATAGTGTT GATGTGAAAA 
AGGTTGTCAA AACTATTGAC TTATCTAAAG 
AACTTTATAA AGATTATAAG ATTGTGACGA 
AAACCTCTAC GTTGGAAGAA ACTCCACTAC 
ACATCGGCTC TACTAATCTC GTCAAAGTAA 
TCTTAACAAG TAAACCTGTG GATGTGCAGA 
ATAAAGTTGT TTCCCCTCCC AGTTGAAAAA 
CTTTACAAAG TCCCTGCTAA GGCCCTAATT 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 



AAAAACAAAC 


GGAACCAACA 


GAGGAACCAA 


780 


CTCGCTATGA 


GGAAGACCTT 


ACTAAGGAAC 


840 


AGAATGGCAG 


TAGAACAGTT 


ACTACTCCAT 


900 


CAGAAGGCAC 


TTCGACAACT 


GATGAAGCTG 


960 


CGAAACCCAA 


AGAAAAATTA 


GCTCCAGTCT 


1020 


TGTTGCGTAG 


TGCGAGACTT 


ACTTATCATT 


1080 


AAATTCATGC TGAAATTAAA AATGGCGATA 


1140 


AGAGATTATC 


AGATGCTGTT 


GACGGTCTTG 


1200 


GTATGACCTA 


TGATAGAGGT 


AATGGTGAAG 


1260 


GATTAGACCT 


CAAGAAGGTT 


GAATTGAAAA 


1320 


ATGAGGATGG 


TACTGAGGTG 


GCAAGTGACT 


1380 


ATTACTACCT 


CAAAGTAACT 


TCCCGTGATA 


1440 


ATTGAAGAGG 


TGACTGAGGA AGGTCCACCA 


1500 


TGAT 






1534 



ID NO:33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLC :Y: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGGATTTGTT TATCTCAAGA AGAGTCGCGA TATTCTCCGT GAAAGTTCAG AATTGATTAN 
CCAAACGGTA GAAGATTATC TTCAAGGAGA TGACTTTGAC TGGGCAGATC TTAAAGGGAA 120 
GGTTCGANAT AATTTGACCA AGTATCTCTT TGACCAAACC AAGCGTCGTC CANCTATTTT 180 
ACCAGTAGTC ATGGAAGCAA AATAATCGTT GAAATAAACA GAGAGAAAGT CGAGTTTCGG 240 
CTTTTTCTTA TAGAAAAATA GAAGGAGAAA ATCATGGCAG TAATGAAAAT CGAGTATTAC 300 
TCACAAGTAT TGGATATGGA GTGGGGGGTG AATGTCCTCT ACCCTGATGC CAATCGAGTG 360 
GAAGAACCAG AGTGTGAAGA TATTCCCGTC TTGTACCTTT TGCACGGGAT GTCTGGAAAT 420 
CATAATAGTT GGCTTAAGCG GACCAATGTA GAACGCTTGC TTCGAGGAAC TAATCTCATC 480 
GTTGTTATGC CCAATACCAG CAATGGTTGG TACACCGACA CCCAGTATGG TTTTGACTAC 
TACACGGCTC TAGCAGAGGA ATTGCCACAG GTTCCTGAAA CGCTTCTTCC CTAATATGAC 
TAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATG GGAGGCTACG GCTGCTTCAA 
ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAG 



60 



540 
600 
660 
695 



(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1096 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTCATCCAAO 


AA I AuL A i An 


AAoAAL 1 v. i i 


CTGTTTTTTC 


TTGAGGAACG AGAAAGTCAG 


60 


CAATCTCCAT 




Al 1- AAA Xa-LTX 


L I AAAOAvJAT 


» \3n X X X X X 


A AAfiT AttT A T 


120 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 


ACCTCATACT 


x x uno i il in 


TPTATT21TAf* 


180 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAGGC 


TCCTCTATAC 




X\_\~V^ i. xWjVso X 


240 


CTTTCTATAG 


GAGAC TCCAA 


GAGAAAATTT 


CTG C AGACAG 


AT Af? A & A & ar? 
AlnunnnAnu 


X X Lnnnn X 


300 


CGGCTCTTAG 


CCGATTTTGA 


AGACCTTATA 


CATCAGAATA 


CTTATAATTT 


AAAGGTTGCT 


360 


ACACCGAGGA 


TAGAACGATT 


TAAGTTTCTG 


AGAATTTGAA 


GACTTTGCTC 


AAATTTCTTA 


420 


TAACGAATTC 


ACTCCGTACT 


CTTCAACAAG 


AAGGACTGTA 


TCTCTTTCCA 


AAAGAGATGA 


480 


TACATCCTGC 


AAATCTACAA 


ATGCATTCCT 


TTTAAAGCTT 


CTTGACTCTG 


TTTCAATTTA 


540 


TCTAGGATAG 


CTTTATTTGA 


GCTAACGATG 


GTCAATTCCT 


GTCCAGTATT 


TTTGTATGAC 


600 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


660 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


720 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


780 


TTTAAGCCAA 


GCGCTGTTTC 


ACCATTTGCG 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAG1 


840 


TTTTCAATTT 


TCACATTTGC 


ATAAGCATGT 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTx x 1 


900 


TTCTCTAACA 


AGTCATTGGC 


ATCTTTACTT 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


960 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


1020 


AAGGTTGTTT 


CCGTCGTGTC 


TAATTCACTG 


GCAATTTTTT 


TCAACTCACT 


GGGTTCTACA 


1080 


TCATTGAAAA 


GATAAG 










1096 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 60 

AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 120 

CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTGCCAAGCC TAGTAAACAG ATACTTGCTA 180 

ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 240 
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TGTTCCACCT AGCTTTTGCT ACTTACTGAT TATTTTACAA AGTCAAGCAA AGCCAAGAAG 300 

CTTTCAGCTT CAAGTGACGC ACCACCTACA AGGGCACCGT CAACGTCTGG GCAAGCCATG 3 60 

TATGAAGCAA CATTTTCAGG TTTAACAGAA CCACCGTATT GAACACGAAC TTTGTCTGCG 420 

ACTTCTTGAC CAAAGTCAGC AGCTACAACG TCACGAACAA CTTTACACAT TTTTTGTGCA 480 

TCGTCTTGTG AAGCTGATTT ACCAGTACCG ATAGCCCAGA TTGGCTCATA AGCGATAACT 540 

GAGGCAGCAA CTTGTTCAGC AGTCAATCCA GCCAATGCAG CAGATACTTG AGCACCTACG 600 

AATTCAGCAG CTTTACCAGC TTCGTAAGTT TCAAGTGATT CACCACAACA GATGATTGGA 660 

AGCATACCGT TCGCAAAGAT TGCTTTTGCT TTTTTGTTGA TATCTTCGTC AGTTTCATGG 720 

AAGTAGTCAC GGCGTTTCTG AGTGACCGAT AACAACGTAG TCAGTACCGA TTTCTTTCAA 780 

AACTTGTGGG CTAGTTTCAC CAGTGAAAGC ACCTGCATTT TCTCCGTAGC AGTTTTGAGC 840 

AGCGACTTTT AAGTTTGAGC CTTNGGCAAC AGCAAGNACA GTTGTCAAAT CAAGAGCTGG 900 

AGCAGNGATA CCTGCTTCAA CAAGATCTGA TGAAGGAAGT TTTGATGCAA CTGCTTCAAC 960 

GAATGNTCCA GCTTNTTGGG GATTTTTGTT CATTTTCCAG TTACCAGCGA TAAATGGTTT 1020 
ACGTGACATT TCACATA 

1037 

(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1340 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : single 
- (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:36: 

CTTATCGTAC TAAGGATGGC AGTGTTCAAC TGTTCCGTCC TGATGAAAAT GCTAAACGCC 
TGCAACGTAC ATGTGACCGT CTCTTGATGC CAACAAGTTC CGAACAGACA TGTTTGTAGA 
AGCTTGTAAA GCAGTTGTCC GTGCGAATGA AGAATACGTA CCACCATACG GAATAGGTGG i0M 
AACTTTATAT CTTCGCCCTC TTTTGATTGG TGTCGGAGAT ATTATCGGGG TAAAACCGGC 240 
AGAAGAGTAC ATTTTCACCA TCTTTGCTAT GCCAGTTGGA AATTACTTTA AAGGTGGTTT 300 
GGTCCCAACC AACTTCTTGA TTCAGGATGA GTACGACCGT GCAGCACCAA ATGGTACAGG 
TGCGGCTAAG GTTGGTGGAA ACTATGCTGC AAGTCTCTTA CCAGGAAAAA TGGCCAAGTC 
ACGCCATTTC TCAGATGTTA TCTATCTGGA CCCATCAACT CATACAAAGA TTGAAGAAGT 
CGGATCAGCT AATTTCTTTG GAATTACAGC TGATAATGAA TTTGTAACAC CATTGAGTCC 
ATCTATCTTG CCATCTATTA CCAAGTATTC CTTGCTTTAT TTGGCAGAAC ATCGCTTGGG 
ATTAACTCCT ATTGAGGGTG ATGTTCCAAT TGATAATCTT GACCGTTTTG TAAAGGCAGG 
TGCCTGTGGT ACAGCAGCGG TTATTTCTCC AATTGGAGGT ATTCAACATG GTGATGATTT 
CCATGTTATT CTATAGTGAA ACAGAAGTAG GTCCTGTGAC ACGTAAATTA TATAATGAAT 
TGACGGGTAT TCAGTTTGGC GATATTGAAG CGCCAGAAGG TTGGATTGTA AAAGTAGATT 
AAAATAAACC AAAGGAGATT TTTTATGAAA TAGAAAAAGT GGCTCTTAAC AGCAGGAGTG 
GTCCTGAGCA CGTCAGCTAT TTTAGTGGCT TGTGGAAAAA CTGATAAAGA AGCAGATGCA 
CCGACAACAT TTTCTTATGT CTATGCAGTA GATCCAGCAT CATTGGGCTA CAGTATAGCG 
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60 
120 
180 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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ACTCGAACAT CGAGGACAGA CGTTATTGGA AATGTTATTG ATGGTTTGAT GGAAAATGAT 1080 

AAATACGGCA ATGTTGCTCC TTCTCAAAAA GACTATGATT TGAACAGTAC AGGATGGGCT 1140 

CCAAGCTATC AAGATCCAGC GTCTTACTTG AATATTATGG ATCCAAAATC TGGTTCTGCC 1200 

ATGAAACACC TTGGCATTAC GAAAGGAAAA GATAAGGATG TTTGTAACTA AACCTGGTTT 1260 

GGATAAATAT AAGAAATTGT TAGAAGATGC TGTTTCTGAG ACCACTGACC TAGAGAAGAG 1320 

ATATGAAAAA TATGCCAAAG 1340 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CTCTGACGGC 


GTCGCCACTT 


AAGAAGAGTA 


TCAAAAAGAA 


AAATAGAAAA 


TTAACTAACA 


60 


AGGNAGAAAA 


ACACATGTCT 


AAAATTATCG 


GTATTGACTT 


AGGTACAACA 


AACTCAGCAG 


120 


TTGCAGTTCT 


TGAAGGAACT 


GAAAGCAAAA 


TCATCGCAAA 


CCCAGAAC *A 


AACCGCACAA 


180 


CTCCATCTGT 


AGTCTCATTC 


AAAAACGGAG 


AAATTATCGT 


TGGTGA'iJCT 


GCAAAACGTC 


240 


AAGCAGTCAC 


AAACCCAGAT 


ACAGTTATCT 


CTATCAAATC 


TAAGATGGGA 


ACTTCTGAAA 


300 


AAGTTTCTGC 


AAATGGAAAA 


GAATACACTC 


CACAAGAAAT 


CTCAGCTATG 


ATCCTTCAAT 


360 


ACTTGAAAGG 


CTACGCTGAA 


GACTACCTTG 


GTGAGAAAGT 


AACCAAAGCT 


GTTATCACAG 


420 


TTCCGGCTTA 


CTTCAACGAC 


GCTCAACGTC 


AAGCAACAAA 


AGACGCTGGT 


AAAATTGCTG 


480 


GTCTTGAAGT 


ANGACGTATT 


GTTAACGAAC 


CAACTGCAGC 


AGCTCTTGCT 


TATGGTTTGG 


540 


ACAAGACTGA 


CAAAGAAGAA 


AAAATCTTGG 


TATTTGACCT 


TGGTGGTGGT 


ACATTCGACG 


600 


TCTCTATCCT 


TGAATTGGGT 


GACGGTGTCT 


TCGACGTATT 


GTCAACTGCA 


GGGGACAACA 


660 


AACTTGGTGG 


TGACGACTTT 


GACCAAAAAA 


TCATTGACCA 


CTTGGTAGCA 


GAATTCAAGA 


720 


AAGAAAACGG 


TATTCGACTT 


GTCTACTGAC 


AAGATGGCAA 


TGCAACCTTT 


GAAAGATGCG 


780 


GCTGAAAAAG 


CCAAGAAAGA 


CCTTTCTGGT 


GTAACTTCAA 


CACAAATCAG 




830 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
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GTGGTGGTTT GACTGCCCTT ATCGAAGCAG GTTTCGAANT CTAAGACANA AGCAGGTTAC 
GCTCCAGAAT TGGCTTAACT TTGAAGTTCT TCACGGAAAT GAAATTGATC GTTCACTTGA 
TCTACGAAGG TGGATTCAAG AAAATGCGTC AATCTATTTC AAACACTGCT GAATACGGTG 

ACTATGTATC AGGTCCACGT GTAATCACTG AACAAGTTAA AGAAAATATG AAGGCTGTCT 240 

TGGCAGACAT CCAAAATGGT AAATTTGCAA ATGACTTTGT AAATCACTAT AAAGCTGGAC 300 

GTCCAAAATT GACTGCTTAC CGTGAACAAG CAGCTAACCT TGAAATTGAA AAAGTTGGTG 360 

CAGAATTGCG TAAAGCAATG CCATTCGTTG GTAAAAACGA CGATGATGCA TTCAAAATCT 420 

ATAACTAATT AGAAATATAT AGCGCTGGAG ATGATTTTAT GAAAAAGATT ATGAGAAAAA 480 

TTGCATCGTT ATTATTGGTT CTAGTTGTAT AATGTAATTA CACCGTCGGT AATAGTGCTA 540 

GCAGACCAAA ATAAAGCAGA TTGGTCGTAT GATGAAAATA CTGTAATTAA CATTTATGAT 600 

GATGCTAATT TTGAAGATGG TAGGTTGCAT ATGACCTTTG AACAATTCTT CAAATTGGCA 660 
CAAATAG 



60 
120 
180 



667 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CGGTTCCAGA NACTGGTATG GCTAGGAGTC GCGACAAAGT GTATCTTATG ATGGTCGATT 
TGATACTCAA TACGCTCAAT ACTTACGGAA AGAATAAACG ATCATTGTCA ACCGGAATAG 
TATGATAATC GTTTCTTGGT GGAATAAGTT CAATCTTATC CTTCTTAGGC ACAATAATGG 
AAGAGCCCAG TGTTCGATAA ACACGATTAT TAAGGCTGGC AATTTCCGTT AATTGCAAAG 
CTTCAATGGT AGGGTGTAAA ACAGCACCAC CAAGAGACTA GTTATAGGCA GTACTACCAG 
TCGGTGTCGA AACTGTTAGC CCGTCTCCAC GAAAACGTTC AAAGGGAACA CCATTTATTA 
CAATATCTGC CACCATGGTT CGATCAGACC TGCGGATGCT GGCTTCGTTG AGTGCTCTGA « v 
AAATCTTAAC TTCACCATTT TCAAGAAAGA CCTTCACATT CAGAACAGGG TAAGAAACCC 480 
TTGCCCCAGT ATCTAGCTGC AAATTAGTCA CTAGCTTGTC CAACTCAAAA TCACGATAAT 540 
CTGTATAGAA GCCCAAATGT CCAGTATGAA GACCGATAAA GCGGACCTTG TCAAGCTGAT 
TTTCGTACTT ATGAAAGGCC GACAAGAGCA TACCATCCCC GCCAATGGAA ATGACAATAT 
CCGGATTGGT ATCATTGAGT TATAAACTGA TTTCTCTTCA AACGATCTCG CAATTCATAC 
AAAACCCTTT GACTCTGCGG TTTTCTATTG GCTATCAGAT CAATTCGTTT ACCTGTATTC 
TTCATCTGTA TCGTCACTGN NTCCAACACC GTCATTTAAT TTTCTACTCA AAGGATCAAA 
AAGTGCCTGG GCTTCTTGGA TATCATCACG AATTTCACCC ATTTCTTCAT CCAACTGATG 
GGCGATTCTA GCTGTAATTT CCAGTCGCTT CTTAATCTCA TCTGGGAAAT CCCCTTGGTA 
CTTGTAGTTG AGAGAATGTT CTATCGTTGC CCAGAAATTC ATGGCCAAAG TACGAATTTG 

79 
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AATTTCTGCC AAAATAGTCT TAGCTCCATT 
ATGATAGGAA CGATAGCCTG ATGCTTTTCT 
TCGCATATCC TGACGCTTGT GCAAAATATC 
AACCATCACA CGTAAGCCAG CAATATCCTG 
GCCACGACGA GCCATTTTTT CTTTGATGCT 
CTCAATTGGA GAATGCTTAT TTTGCTTACG 
CTTTAACTCA CCAACAG 



PCT/US97/05306 

GATGGTATCA ACCGTATATT CTACTACCAC 1080 

ATGAGTAATG TAATCTCGCT CCTGTATGAT 1140 

CACTACTTCC TTGACGTCAT CTACAAACTG 1200 

CAAATCGTGT TCCAAGGTCG CATAAGTAAT 1260 

CTCAATTGGC TTGACTCGAC CGGTCACAAA 1320 

ATATTGCTTA CGAATACCAC GAAGTTTAAT 1380 

1397 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



CTCCCATGAC 


ACCTGCACGC 


TCCGTACGGG 


CCACGTGCTT 


TTTAATCCGT 


TCAACAACTT 


60 


CATAACCCGC 


TTCAACATCC 


ACACCCGACT 


GGGCAT/.TGC 


ATTTTTATTT 


GTCATTTGTT 


120 


TTTTCCTTTT 


CTTTAATGGA 


GAATCTGTCG 


CCTACTTGTA 


AAAACTGGTC 


TTTTCTTCCA 


180 


AACTTCTACG 


ATAGTCTTCT 


TCGTAGTCAT 


AGAGAGGCGT 


TGGGTAGTCA 


CCGTCAAAGT 


240 


AAGCGACACA 


GAGACCACCG 


TTCGGCGCAT 


CTGTTTCGAT 


ACCAATCGAC 


TCAATCAAGC 


300 


TATCAATTGA 


AAGATAAGTC 


AGACTGTCCG 


CACCAATGAT 


TTGGCGAGTT 


TCTTCGACCG 


360 


TATGATTGGC 


TGCAATCAGC 


TCCTGACGGG 


TCTGGATATC 


AATCCCGTAG 


AAACATGGAT 


420 


ACGCTAGTGC 


AGGACTTCCA 


ATGGCAACGT 


GAACCTCAGT 


CGCACCCGCT 


TCTTTCAAGA 


480 


GCTGAACGAT 


ACGACGAGAG 


GTTGTTCCAC 


GTACAATGGA 


ATCATCCACC 


ATGACCACAC 


540 


GTTTGCCTTT 


GACAACACCC 


GAAACAGCAG 


ACAGTTTCAT 


CCGCACTCCT 


TGCTCCCGCA 


600 


ATTCTTGAGT 


CGGTTGGATA 


AAAGTTCGCT 


GGGTGTATTG 


GTTTTTGATC 


AGACCCATTT 


660 


CATTTGGTAA 


GCCTGATTCT 


TCCGCAAATC 


CCATAGCCGC 


GCTTAGGGAA 


GAATTGGGCA 


720 


CACCAACTAC 


AATATCTGCC 


TCATGCTTAA 


ATTCTCGCGC 


CAATTGCGCT 


CCCATTCTCT 


780 


TACGTGCCGT 


ATGGACATTG 


ACCCGTGGAT 


ATTAGAATCA 


GGGC 




824 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTTCTTCCTT GGTACTTCCT CAATTGGATT ACCTTTGGCA TTGCTAGTTT TTCACTCCTA 
CCTTATATCC AAGGCACTAA AATTATGTTT TACCGAGCAG TATTGGCTCG AAAACGTCCA 
AAAGCTTGAA GGTTTTCCCC TCAAGCTTTT TTCTATCAAT GAATTTCTCC GCCTACCAAC 
GTGAGTTCCT GATCTCCATG TTCTATCATA GCGCCGATTG CTGCTTCTAT CCCTCGCCGA 
ATATCCACTA AACTCATAGT TGGAGTAGTC GGTCTGTTCA CCACCTGTTC CATCATATAA 
GGAATATGCA TAAAACCTGC CTTAACATAT GGGAATTTCT TTTCTACCAA ATAGAGAGCC 
TGATACATCA AATGACTGCA AACAAAAGTC CCTGCACGTA TTGGAAACAG ACTGCCGGTA 
AGTCCTTCTT TTTTATTAGC TTGTACCATC GCTTTGATAG GTAAACTACT AAAATAGGCC 
GATGCTCCAT CAATACGAAT CGGTGTATCA ATTGGTTGAT TGCCTTCGTT ATCAGGTATG 
CGAGCATCAT CTTGATTAAT AGCCACTCGT TCAGGTGTTA AGCCGGTCCT GCCGCCTGCT 
TGTCCAATAC AAAGTACAGC ATCTGGTTGA TATCGTAATA TTTCTGCCTC TAAAACTTCT 
GACGACTTAT AAAAAACCGT TGGAATTTCT ACCCAGCGAA CTTCAGCCCC ATTAATCTCA 
GATGGTAATA ATTTTACAGC CTCCAAAGCT GGATTAATCT TTTCACCTCC AAAAGGATTA 
AAACCTGTAA CCAATATTTT CATTTTATTT TCCTTTACTA AAATGCGAGA AAGTACATTA 
AGAATATGTG AATAACAATC ATTACTAGAG CAACACCTGC TTGAGCCTTT ATAACGCCAT 
TCTGATCTTT CATATCCATC AATGCTGCTG GTAGAGCGTT AAAATTAGCA GCCATTGGGG 
TCAATAAGGT CCCACAATAA CCTGCTGTCA TGGCAAGAGC ACCAGCCACA ATTGGATTAG 

(2) INFORMATION FOR SEQ ID NO: 42; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
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<ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTCGTAAAAA CACTAAAGGA GGTAAATAAC CAATGTTAGT ACCTAAACGT GTTAAACACC 
OTC^TT CCGTGGAAAA ATGCGCGGTG AAGCAAAAGG TGGAAAAGAA GTAGCATTCG 
GTGAATACGG TCTTCAAGCT ACAACTAGCC ACTGGATCAC TAACCGCCAA ATCGAAGCTG 
CTCGTATCGC CATGACTCGT TACATGAAAC GTGGTGGTAA AGTTTGGATT AAAATCTTCC 
CACACAAATC ATACACTGCT AAAGCTATCG GTGTGCGTAT GGGATCTGGT ^IZZ 

CTGAAGAGAT TGCACGTGAA GCGCTTCGAC TTGCTAGCCA CAAATTGCCA GTTTAAATGT 
AAATTCGTAA AACGTGAAGC AGAATAAGGA GAAGGCATGA AACTTAATGA 
TTTGTTAAAG AACTTCGTGG TCTTTCTCAA GAAGAACTCG CGAAGCGCGA 

»u— « Terrier tcg™** can***, 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



CTTCAGCGAC 


ATATCTATCC 


ACAACTTCTC 


TCGATTCATG 


TTCCTCTGAA 


AATGCCTGAA 


60 


ATTTTAATTG 


ACTAATTTGA 


TCCTGATACG 


AACTATCTGC 


TAACAAAACT 


TCAAGATGGG 


120 


AAACATTTGC 


TAACGGATAA 


GGTCTTCTAT 


CCTTACCTAA 


CCAAGTTTCT 


GTCTCTTCAT 


1B0 


CCTCTATTAG 


TCCCCAGTTA 


CTGGCAAAGT 


CAGGATGATT 


CTCTAAAAAA 


ATACGTTCTG 


z40 


TCTGAAAAGT 


GACTGACCGA 


ATGGGGAAAG 


AAGCTGTTCC 


TCTCTCAAAA 


CTAGTAAACA 


300 


ATGCACGCGC 


AATCCCCTGA 


CGGCGATGAC 


CTGGATGAAC 


CAGTATCGTC 


ACTTCTACAT 


360 


CTTGGTCATC 


TGCATAGACA 


GTTAATAAAC 


CAACAAGTTC 


GCCTTTTTCA 


TAATAAAGGA 


420 


AAAAGGCGGG 


CATGTTTGGA 


TCAAAATTAA 


GCATGTTAGA GAGATAGGGA 


TCGCGATAGG 


480 


TACCGTCATA 


GTTTTGGCAA 


CAGTTAATTA 


CTTTTTTCGC 


CTCAGATAGC 


TCTTCTTGGC 


540 


TTAACTTGTT 


TCTTGCTTGA 


ATC>?ATAGG 


TATCCTCTAC 


AAACCAGACG 


ATCTGTGACT 


600 


GGCATCTTTA 


GCCTGCTCGA 


GTTTATTGAC 


ATAATACTCC 


CGTTTTTCTT 


CGACTTCGTG 


660 


AATAACAGGC 


TCATCTTTCT 


TACCATGAAG 


ACGGACAATC 


TTGGCCGGAA 


TACCGACAAC 


720 


CGTCACATCA 


CTAGGTACAT 


CTGCTACGAC 


AACTGCTGCA 


GCACCGACCT 


TGGCATTTTC 


780 


ACCAATTTCC 


ACAGGCCCGA 


TAACTTGGGC 


ATGGGCTGAT 


ATGAGGGCTC 


CCTTTCGTAC 


840 


AGTCGGATGG 


CGTTTGCCAC 


AGTCTTTCCC 


TGTTCCCCCG 


AGAGTCACTC 


CGTGATAGAG 


900 


AAGAACGCCT 


TTTTCAACAA 


TCGCTGTCTC 


TCCAATCACC 


AGACCAGAAC 


CATGGTCAAT 


960 


AAAGACACCT 


GAATCAATCT 


GGGCTCCTGG 


ATGAATCTCA 


ATCTGAGTCC 


AAAAGCGCCA 


1020 


AAACTGACTG 


TGCATACGAG 


CTAAGAGTTT 


GAAGTCGTAC 


TTCCAGAGAA 


AATGCGAGAG 


1080 


ACGGTGGGCC 


GCCAAGGCCT 


TGACACCTGG 


ATAAGTCAGC 


AAAACCTCCA 


AAGTGGTGCG 


1140 


GGCCGCTGGA 


TCATTTTCTT 


TTACGATATC 


AATGGTTTCG 


CGCCACCACC 


CCATACATTT 


1200 


CTCCTTTTCT 


TACTCTGAAT 


CTT 








1223 



(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 845 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
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60 



GGTGAGTGCC TTTATTAGCG GAGCTTCTCA AGTGATTGTT GCCTTGATTA TCTTCCTTTC 

ATGCTCTTTT ATCTCTTGCG TGATGGGAAA GGCTTGCGTA ACTATTTGAC CCAATTCATT 120 

CCAAGCAAAT TGAAGGAACC TGTTGGACAA GTTCTATCAG ATGTGAATCA ACAGTTGTCC 180 

AACTATGTTC GAGGGCAAGT GACAGTGGCT ATTATTGTAG CAGTAATGTT TATCATCTTC 240 

TTCAAGATTA TTGGTCTACG CTATGCGGTT ACGCTGGGGG TTACTGCTGG TATTTTAAAT 300 

CTGGTCCCTT ATCTTGGTAG CTTTCTAGCC ATGCTTCCTG CCCTAGTATT GGGTTTGATT 360 
GCTGGTCCAG TCATGCTTTT GAAAGTAGTG ATTGTCTTTA TTGTAGAACA AACTATTGAA 
GGCCGTTTTG TCTCTCCATT GATTTTGGGA AGTCAATTAA ACATCCACCC TATTAATGTT 
CTCTTTGTTT TGTTAACTTC AGGATCTATG TTTGGTATCT GGGGAGTTTT ACTTGGTATT 

CCGGTTTATG CCTCTGCTAA GGTTGTCATT TCAGCCATTT TCGAATGGTA TAAGGTAGTC 600 

AGTGGTCTAT ATGAATTAGA GGGTGAGGAA GTCAAGAGTG AACAATAGTC AACAGATGTT 660 

ACAGGCTTTG GAGGAGCAAG ATTTAACTAA GGCTGAGCAT TATTTCGCCA AAGCTTTAGA 720 

AAATGATTCA AGTGATCTTC TGTATGAGTT GGCAACTTAT CTTGAAGGGA TTGGTTTCTA 780 

TCCTCAGGCC AAGGAAATTT ACCTGAAAAT TGTAGAAGAT TTTCCAGAGG TTCATCTTAA 840 
TCTAG 



420 
480 
540 



845 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 802 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CTATGGATTA ATCTTCTTCT ATCTTATCCT GTTCCTAAAT ATGTTGGTAG AAAAATGAAG 60 

CAGTTGTTAC GAGTGTCTGT ATTGGAAAAA AATTTTGTCT ATACAATTCT TTAAATGATA 120 

TGAAGACTAG GAATGCCAAA ACACTGACAT ATTTCGTTTA AAGCCTATCC AGTCTCTTCA 180 

ATTGTATTTG TAGAAATGCT ATCAGTAATA TATTCGCGAA CACTATGATT TACAACAGTG 240 

TGGTCGTACA TCTTTGTAAC TTTATTATAC AACAACCATT TACAGGTGTA AAGTTGAGCA 300 

GTGTCTTTAG TAAATTTAGG GACTTGTTTT GTCATAGTTT TGATACCTGT ATTCGTTGTT 360 

TCATGAGTAT TTGACGTTTC TTTTCATGTA GGCATTTCAA CAATTAGAAT GATATTTTTC 420 

CGAGGATGAA CTTATATATT ATGACTTTTT AGAGTTATGA TGCCTCTTGA TTTGACTATC 480 

GCTATCAATT TCATCCACTT TTACAGTTTC ATGCAACATA GCGAGAAGGC GGTTTCCTAT 540 

ATGGAAGTCG TCTTGTTTTC ATATTGTCTA AATACGAGAT ATTTACTACA GGTTGTAGAT 600 

ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGAGACAA GTGAAATAAT GCCTGGTGAG 660 

TTAGATGAAA TAAAAAGTAA GGAGTTGAAA GTACTGGTTC NTTGTGCAGG GTCTGGAACA 720 

AGTGCGCAAT TAGCCAATGC AATTAACGAG GGCGCTCNAT TAGCAGAAGT TAGAGTGATT 780 
GCGAATTCAG GAGCGCACGG AG 

802 
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(2) INFORMATION FOR SEQ ID NO: 46: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



CTGGGACTCT 


CTTCGTAGAA 


GTCATGCAAG 


AATATTTTGA 


TCAAAAGAGG 


AAATCATGAA 


60 


AAAAAGAGCA 


ATAGTGGCAG 


TCATTGTACT 


GCTTTTAATT 


GGGCTGGATC 


AGTTGGTCAA 


120 


ATCCTATATC 


GTCCAGCAGA 


TTCCACTGGG 


TGAAGTGCGC 


TCCTGGATTC 


CCAATTTCGT 


180 


TAGCTTGACC 


TACCTGCAAA 


ATCGAGGTGC 


AGCCTTTTCT 


ATCTTACAAG 


ATCAGCAGCT 


240 


GTTATTCGCT 


GTCATTACTC 


TGGTTGTCGT 


GATAGGTGCC 


ATTTGGTATT 


TACATAAACA 


300 


CATGGAGGAC 


TCATTCTGGA 


TGGTCTTGGG 


TTTGACTCTA 


ATAATCGCGG 


GTGGTCTTGG 


360 


AAACTTTATT 


GACAGGGTCA 


GTCAGGGCTT 


TGTTGTGGAT 


ATGTTCCATC 


TTGACTTTAT 


420 


CAACTTTGCA 


ATTTTCAATG 


TGGCAGATAA 


CTATCTGACG 


GTTGGAGTGA 


TTATTTTATT 


480 


GATTGCAATG 


CTV AAGAGG 


AAATAAATGG 


AAATTAAAAT 


TGAAACTGGT 


GGTCTGCGTT 


540 


TGGATAAGGC 


TTTGTCAGAT 


TTGTCAGAAT 


TATCACGTAG 


TCTCGCGAAT 


GAACAAATTA 


600 


AATCAGGCCA 


GGTCTTGGTC 


AATGGTCAAG 


TCAAGAAAGC 


TAAATACACA 


GTCCAAGAGG 


660 


GTGATGTCGT 


CACTTACCAT 


GTGCCANAAC 


CAGAGGTATT 


AGAGTATGTG 


GCTGAGGATC 


720 


TTCCGCTAGA 


AATAGTCTAC 


CAAGATGAAG 


ATGTGGCTGT 


CCTTAACAAA 


CCTCAGGGAA 


780 


TGGTTGTGCA 


CCCGAGTGCT 


GGTCATACCA 


GTGGAACCCT 


AAGTAAATGC 


CCCCCATGTA 


840 


TCATATTAAG 


GACTTGTCGG 


GTATCAATGG 


GGTTCTGCCT 


CCANGGATTG 


TTCACCCGTA 


900 


TTGATAAGGA 


TACGTCACGT 


CTTCTCAT 








928 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCTTTTACA GAGGTGTTTC CTATTTTGTG CTTCCATTTT TGGGGCAAGG AATTGTAGAT 60 
GGGGATGGCA ATCCTATCTT TTTATTGATT ATGATATTCG TTTGCTTCAT AGTTTTAGTC 120 
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TTTTTGAAAT GGTTAGACTA TGATTTCACT AGATTGAGAA GGGAGTTTCT AGATACAGGT 180 

TTTCAAAAGT CTCTTACTAA GATTAACTGG GCAATGGGGG CTTATTATCT AGTGATGCAA 240 

AGTCTATCTT ACCTTGAATA TGAACAAGGT ATTCAATCAA CGACTGTTCG CCATCTCATC 300 

CTAGTGTTTT ACCTACTCTT TTTTATGGGG GGTATCAAGA AATTGGATAC CTATTTGAAG 360 

GAAAAACTTC AGGAGGAACT GAACCAAGAG CAGACCTTGC GCTACAGAGA TATGGAACGC 420 

TATAGTCGGC ATATAGAGGA ACTTTACAAG GAAATTCGGA GTTTTCGCCA TGACTACACT 480 

AACCTCTTAA CCACTTACGT TTGGGCATTG AAGAGAAGGA TATGGAGCAG ATAAAAGAGA 540 

TGTACGACTC GGTCTTAAGG GATTCCAGTC AGAAATTGCA GGACAATAAA TATGACCTGG 600 

GCAGATTGGT GAATATTNGT GACCGTGCCC TCAAGAGTCT CCTAGCTGGA AAATTTATAA 660 



AAG 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

85 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

CTAGATTAAT CTGATATTTC TTTTTGGCAA ACCAAATGCT TCCTATCAGA AAAGCTAGCA 60 
ACAGCAACAT GGTAATAATG ATATGAATGG TCATTTTCTT CTCCTATTCT GCCTTTTCAA 120 
TATCTTTTTT CATCTCGTCA ACATTGAACT TAGCAAACAA GTATTGACGG TCTTGGACTG 180 
GGAAACGTTG GTCCAACTGG TCAACTGCTC CCACCTCGAT AATGCCTTCC TTGATGACAA 240 
AATCCATGAC ATGTCCACCG AAGGTCAAAT CATCTGATAT GAAGTGCAGA TGGTAGCCTG 300 
CCACACTGAC CCCATGGAAA ATCTCAGGCG TCCAGAAACC AACAATAGTC CCCGCAACAT 360 
TGTCACGACT ATATTCCGGT TGATGGGTTG CGACATCAGC AAACTTGGTA TCGGGTGTTG 
ACTTAGGAAT CATACGCACA TGCATATGTT AAAAATTCCC CCCCGAATCT TGATAGAGCG 
GAAAAGATTT TCCCCATCAT AATAAGACTC AATTCGTTCT TTCAATTCCT TGTCTGTCAT 
CTCAAAGCGC TGGCGAAAAA TGACCTCTGC CTGATGCGGT ACCACTGCAG CGTAAGGAAT 
AAGGGCATCT GGTGACACTT CCACAATTTC TGGCTGGTCT CCTGACCCTT TGG 
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SEQUENCE DESCRIPTION: 


SEQ ID NO: 


49: 






PT APC2 AT ATP 


TTTGGTCTTT 


TGTAAGATAT 


GAGGTCCACC 


CTTATGCGCC 


TCAGTTGGCA 


60 


TTTPATGCGA 


TTCAAGAAGT 


TGCCCCTCTT 


GATCAACCAA ACCATACTTG ATGTTGGTTC 


120 


CACCGATATP 


AATTGCAACG 


TAATATGTCA 


TAAATACCTC 


CTTTTAGATT 


AGAGGAAGCG 


180 


CTPPTTGGTT 


TCACGAATCA 


AGGCAGCAGC 


CGCTTCTACA 


ACTGGACGAT 


CTTCTTCAGT 


240 


CACTGGTGTC 


AATGGTGAAC 


GAACAGATCC 


AATATTCAAG 


CCTTCATTGA 


TTTTCAAGAC 


300 


TTPTTTG A TG 


ACACCGTACA 


TATTTCCATG 


AGCAGAAGTG 


AGTTTACCAA 


TGATTGCGTT 


360 


G AT ACT 1 AT A P 


TGCAATTCAC 


GCGCTGTTTC 


TAGGTCCTTA 


TCCGCAATCA ACTGATTGAG 


420 


TTTPAAGAAG 


AGTTCTGGCA 


TAGCACCATA 


APT APPAPPP 


ATACCAGCCC 


TAGCCCCCAT 


480 


f^APGPGTPPT 


CCTAGGAACT 


GCTCATCAGG 


ACCATTAAAG 


ACGATATGGT 


CTTCTCCACC 


540 


A APPPTP APA 


AAGGTTTGGA 


TATCTTGAAC 


TGGCATAGAA 


GAGTTCTTCA 


CACCGATAAC 


600 


APGAGGATTT 


TTCAACATTT 


CTGTGTAAAG 


GCTTGGAGTC 


AAAGCAACCC 


CTGCCAATTG 


660 


AGGAATCjTTG 


TAAATCACGT 


AGTCTGTGTT 


TGGAGCTGCA 


GAACTGATAT 


CGTTCCAGTA 


720 


TTTGGCAACT 


GAGTTATTCT 


GGCAAGCGGA 


AATAAATTGG 


TGGAATCCGT 


TGCAATAGCA 


780 


TCTACTCCCA 


AGCTTTCAGC 


ATGGCGAGCA 


AGTTCCATAC 


TATCTTTAGT 


ATTATTGCAA 


840 


GCAACATGGG 


CAATAATGGT 


CAATTTACCT 


TTGGCTACCG 


CCATGACTTC 


TTCCAAAATC 


900 


AACTTGCGAT 


CTTCAACGCT 


TTGGTAGATA 


CATTCACCAG 


AAGAACCATT 


GACATAAGAC 


960 


CTTGAACACC 


TTTATCAATG 


AAGTATTGA 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



TCAAAGAGTA 


ACAAAGGCAC 


CAAATTCTCG 


ATAGGAACGA 


TTTAGCACGG 


TAAACTTCAT 


60 


CCACTTGGGT 


TCACGGAACC 


AAACCAGCAA 


TAATTTCTTT 


GGGCACGGGT 


TAATAGCATT 


120 


TTGGTCAACT 


AGGAGTAGAT 


AGAACACATT 


TCNTTCTTCG 


TCTATATCAA 


TCTTAACACC 


180 


TGTTTCAGCG 


ATAATCTTGT 


CGATGGTTTC 


TCCACCCTTA 


CCGATGACAA 


TCTTAATCTT 


240 


GTCCACATCA 


ATCTTGATCG 


TATCAATTTT 


CGGAGCAGTT 


GGAGCCAATT 


CTGGACGAAC 


300 


TTCTGGAATG 


GTTGCTTCAA 


TGACATCAAG 


GATTTCAAAA 


CGCGCTTTCT 


TGGCTTGAGC 


360 


AAGAGCCTCC 


GTCAAGATTT 


CTGCAGTAAT 


CCCTTGAATC 


TTGATATCCA 


TTTGAAGGGC 


420 


TGTAATCCCA 


TCACGAGTAC 


CTGCAACCTT 


GAAGTCCATA 


TCTCCAAAGT 


GATCTTCCAA 


480 


ACCTTGGATA 


TCTGTCAATA 


CTGTGTAGTT 


ATTTCCATCT 


GAGATAAGTC 


CCATAGCAAT 


540 


ACCAGCTACT 


GGCGCCTTGA 


TTGGCACACC 


ACCAGCCATA 


AGGGCAAGAG 


TTCCCGCACA 


600 


GATAGAAGCT 


TGAGATGAAG 


AACCGTTTGA 


TTCCAAAACT 


TCTGCTACTA 


GACGGATAGC 


660 
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GTATGGGAAT TCTTCCAAGC TTGGCAAGAC TTGAGCAAGA GCACGCTCAC CAAGGGCACC 720 
GTGACCGATT TCACGACGAC CTGGCGCACC GTAACGACCT GTTTCCCCTA CAGAATATTG 780 
AGGGAAGTTA TAGTGGTGCA TAAAGCGTTT CTTGTACTCT GGATCCAAAC CATCAATGAT 
TTGAGTTTCT CCCATCGGAG CCAAGGTCAA GACTGAAAGA GCTTGAGTTT GCCCACGAGT 
AAAGAGACCT GAACCATGTT ACACGAGGAA GGAAGTCAAC AACCGCATCC AAAGGACGGA 
TTTCATCGAC CTTACGACCA TCAGGACGCA CCTTGTCTTC TGTAATTAAA CGTCGCACTT 
CTGCGTGTTC CATTTGTTCC AAGATTTCAG CCACATCACG CATAATACGG TCAAATTCTT 
CGTGGTCCGC ATATTTTTCT TCGTAAACGG CAGTCACTTG GTCTTTCACT GCTTGAGTTG 
CAGTTCACGG GAATCCAATT TCTCTTCTAC TTGAACTGCC TTTTGGAGGT CACTGTTGTA 
GGCTGCAATG ATTTCAGCTT GCAATTCAGC ATCCACGTGA AGCAATTCCA CTTCTGCTTT 
TTCTTTACCG ACAGCAGCAA CGATTTCTTC TTGGAAGGCA ATCAATTCTT TGACAGCTTC 
GTGCCCTTTA AGAAGCGCTT CCAACATGAT TTCTTCTGAC AATTCTTTGG CACCAGACTC 
TACCATGTTG ATAGCGTGCT TGGTTCCAGC TACTGTCAAT TCAAGAAGAG ATTGCTCTGC 
TTGTTCTTGA CTTGGGTTGA TGATGATTTG GCCATCTACA TATCCCACTT GTACCCCAGC 
AATTGGTCCG TCAAATGGAA TATCTGAAAT AGACAGTGCC AAAGATGAAC CAAACATAGC 
AGCCATTGGT GCAGATGCAT TTTCATCATA AGAAAGCACT GTATTGATGA CTTGGACTTC 
ATTACGGAAA CCTTCCGCAA ACATAGGACG AATCGGACGG TCAATCAAAC G 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



840 
900 
960 
1020 
1080 
1U0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1671 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTCAGTATCT TTGTAATCAA CATATCCAAT TTTGTTTGCT GCGATGTANT CAACTTTTTT 
ACGGCGTTTG AATCCGCCAC GACGTNGTTG AGCCATGTTT TTNCTCCTTT A^G^Ic 
TTGTCCATTA GAATGGTAAA TCATCANCTG AAATATCCAA TGGGTTTGTT ZkcZk 
GATTT^ATT ACGTGAAAAG TCTGGTACTG AATTTGTAGG TGCTGAATAG ^C^O 
GTGCAGAGTA AGCTCCACCT GTGTGACCCT CACGCACACT ACGGCTTTCC AACATTTGGA 
AATTCTCAGC CACGACCTCT GTCACGTAGA CACG^C TTGCTGGTTA ^ 
GAGTCTGGAT ACGACCTGTC ACCCCGATAA GTGAGCCTTT TTTAGCCCAG TTAGCAAGAT 
TTTCAGCC^ TTGGCGCCAC ATAACGACAT TAATAAAATC AGCCTCAc" ££££ 
GACTCTTAAA TGTACGGTTT ACTGCAAGAG TAAAAGTCGC AACTGCTACA TTTGATGGGG 

™ C TACAACATTG 
ZlTJI^l TACGCGTCAA TTTTGACGAT CATGTGACGA AGAATGTCAG CGTTGATTT" 
TGGTAAAGTC ^"^ U ^" T ^"* > ^^AGAGCTG CATCGNTCAT TTGCTTCAAC GTTAACGATG 720 

731 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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(2) INFORMATION FOR SEQ ID NO: 52: 



PCT/US97/05306 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



CTGGGTTTTA 


CCACGTTTTC 


CTGATGTACG 


AGCGAGATCC 


TTACGGTTCA 


ACATAGTGTT 


60 


GATAAAGCTG 


GATTTACCAA 


CATTTGAACG 


CCCTGCTAGG 


GCAATCTCTG 


GCAGTTCATC 


120 


CTGCGGATAG 


TGGGACTTAT 


TAGCTGCACT 


GAGCAAGATT 


TCAGCATTGT 


GTGTATTAAG 


180 


TTCCATAGTC 


ACCTCTAGGC 


TGTTTCTAGG 


ATCGGTTTAT 


CCGTTCCATC 


GACAGTTTCT 


240 


TTAGTGATGC 


GAACCAATTT 


CACATTTTCC 


TGACTCGGCA 


CCTCAAACAT 


AACATCTAGC 


300 


ATGGTTTCTT 


CGATGATGGA 


GCGAAGTCCA 


CGCGCCCCTG 


TCTTCCGTTC 


GATTGCTTTA 


360 


TTAGCAATCT 


CTTGAAGGGC 


TTCGTCGTCA 


AATTCCAACT 


CAACATCATC 


ATAAGAAAGC 


420 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


TCAAGATGCG 


AACCAAGTCC 


480 


ATCAACGGAC 


AATTGCTCAA 


GAGCCGCAAA 


AACAGGCAAG 


CGTCCAATCA 


ACTCAGGGAT 


540 


AATACCAAAT 


TTTTGAATGT 


CTTCAGCGAT 


GAATTCTTGC 


ATGTATGAGC 


TGTTTCTCTG 


600 


TCAATCGCCT 


TATTGTTTTG 


ACCAAATCCG 


ATGACTTTTT 


CACCCAGACG 


TTGTTTGACA 


660 


ATTTCTTCAA 


TACCATCAAA 


AGCACCACCC 


ACGATGAAGA 


GGATATTTTT 


TGTATCCACT 


720 


TGAATCATCT 


ACTTGTTGTG 


GATGTTTGCG 


TCCACCTTGA 


GGCGGCACGC 


TAGCAACAGT 


780 


TCCCTCAATA 


ATCTTGAGAA 


GGGCTTGTTG 


CACCCCTTCA 


CCAGAAACAT 


CACGTGTGAT 


840 


AGACACATTC 


TCACTCTTCT 


TGGCAATCTT 


GTCAATTTCA 


TCCACATAGA 


TAATGCCACG 


900 


CTCTGCACGT 


TCGATGTTAA 


AGTCAGCAAC 


CTGCAAGAGT 


TTGAGGAGTA 


TATTTTCCAC 


960 


ATCCTCACCC 


ACATAACCAG 


CCTCCGTCAG 


AG 






992 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATTTTCACAG GGCAGGATGC GGAACGACTA GCCCCATACT TTAACGGAAT TTTGGTAGGG 60 
ACAGCTCTTA TGCAGGCAGA AAATGTGGCC CAGAGAATCA AGGAGTTGCA GATTGACAAA 120 

88 
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GGTTAAAATT TGTGGACTAT CGACCAAAGA AGCGGTGGAA ACAGCCGTTT CAGCAGGAGC 
CGACTATATC GGTTTTGTCT TTGCACCTAG TAAAAGACAG GTGACTTTAG AAGAGGCAGC 
TGAGTTGGCA AAGCTTATTC CTGCAGATGT GAAAAAGGTT GGAGTATTTG TTTCACCAAG 
TCGGGTAGAA CTGCTGGAAG CGATTGACAA AGTTGGCTTG GACTTGGTTC AAGTTCACGG 
TCA^TGOCA GATGATTTAT TTGAGAATTT GCCT^TGCC AGTATTCAGG ClS^ 
AGATGGAAAT GGGCATGTCC CCAATTCTCA GGCAGATTAT CTACTCTTTG AT«SS 

CTTCTTTATC GCAGGTGGCC TTAATGAAGA TAATGTAGTA AAAGCAATTC AACATTTTAC 
TCCCTATGCA GTAGATGTAT CGAGCGGAGT GGAGACAGAT GGACAAAAAG A^CA^ 
GATTAGAAGA TTTATAGAGA GGGTAAAGCA TGGCATATCA GGAACCAAAT 
TTTACGGAAA ATTCGGCGGA COmiWCC CAGAAACATT GATGACAGCA GTTTTGGAGT 
TGGAGAAGGC CTACCGTGAA AGTCAGGCAG ACCCAAGTTT CCAAGAGGAA 

sss crc ™ AcTA cgcaaaaa ac = ™ 

ATATCGGCGG AGCCAAGATT TATCTCAAAC GGGAAGACCT TAACCATACA GGAGCCC Ar a 
AGATTAACAA TGCCTTAGGA CAAG^GC TTGCCAAACG CATGGGTAAA Z^Z A 

ZT^r CAGCACGGTC TGGCAACTGC AAC^ ££££ 

ACATGGAATG TACCATCTAC ATGGGTGAGG AAGATGTCAA ACGCCAAGCC CTCAATCTCT 
TCCGTATGGA GCTTTTGGGA GCTAAGGTTG AAGCAGTGAC AGATGGTTCG S^A 1200 
AGGATGCGGT CAATGCAGCC CTTCGTTCAT GGGTGGCTAA TATCGACGAT A^CCACT^A 0 
TCCTTGGTTC TGCCTTGGGG CCTCATCCAT TTCCAGAAA ACCCACTATA 12 60 



(2) INFORMATION FOR SEQ IT. JJO: 54 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

CTGTTGAATC AACAACACGT ATCCAAGTAT CAGNATCAAA TTATGCAGGT Aat™. 
TTGGAAATCA CCGTGGATGG TTCAATCCAA CAAr-A,™ TTATGCAGGT AATCGTACAA 
TTTATGCAGA TTAATTTACA SSSSJ SET" ""^ 

ACAATCCCTA TTAAAAATTA TATCAAAATA £££££ ATTGGAAAAG tT^ 
TGAAAATTGT CGTGTGAACG ATAATACTCA TTCTTGATGA TATGGTAGAA 

««m gcgag^aa g^aagaaS ££££ SSSS aa^ 7 

GCAGTAATTT CAATGAAACA ACTTCTTGAG GCTGGTGTAC A^SS C^L 
CGCTGGAATC ctaaga^gc TAAGTACATC TTTACTGAAC ZT 
GACTTGCAAC AAACTGTAAA ATACGCTGAC CAAGCATAC^ Tr^lT CCAC ^TATC 

gctaacgatg ca^gtatt g^tcg^gt SSS SSS A^r AGCA 

TGAAGAAGCA GTACGTTCAG G„. CATCAACCAC C^S ££££ 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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TACAAACTGG 


GGAACAATCC 


AAAAACGTAT 


CGCTCGTTTG 


AAAGAAATTA 


AACGTGGATG 


720 


GAAGAAGATG 


GAACTTTCGA 


AGTTCTTCCT 


AAGAAAGAAG 


TTGCACTTCT 


TAACAAACAA 


780 


CGTGCGCGTC 


TTGAAAAATT 


CTTGGGCGGT 


ATCCGAAGAT 


ATGCCTCGTA 


CCCCAGATGT 


840 


GATGTACGTA 


GTTGACCCAC 


ATAAAGAGCA 


AATCGCATGT 


TAAAGAAGCT 


AAAAAATTGG 


900 


GAATCCCAGT 


TGTAGCGATG 


GTTGACACCA 


ATACTGATCC 


AGATGATATC 


GATGTAATCA 


960 


TCCCAGCTAA 


CGATGACGCT 


ATCCGTGCTG 


TTAAATTGAT 


CACAGCTAAA 


TTGGCTGACG 


1020 


CTATTATCGA AGGACGTCAA 


GGTGAGGATG 


CAGTAGCCGT 


TTGAAGCAGA 


ATTTGCAG 


1078 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CTATGAGGGA 


TATAAAGTGA 


GAGAGTTGGG 


CCAACGTTCC 


AAAAACGTCC 


TAAATTCTGC 


60 


CCATTGACAA 


AGGCAACCCC 


CTTACCAAAC 


TCAGACNAGT 


CTAGGTAAGT 


ATCTTTTGGC 


120 


TATTCGACTG 


TAAAGTCATA 


AGCGTAAAAG 


GCTGGTTGTC 


CTTGAGTCCA 


TCCTTTGGAA 


180 


AAATCAATTT 


TCTCAGGATT 


GTCTAGTGGG 


AGTGGATAGT 


GTTTCCAGTT 


TAGTAAGAAA 


240 


TGCAGATCCT 


TACAGACCCC 


TGTCCGAATT 


CCCTTACGTT 


GCGTATCCGC 


TAAGAACTTA 


300 


TGCCCATAGT 


TGACACGCCC 


CATATTTTCT 


ATCAAGATAT 


CTAACCTAGA 


TAGCCCTTTC 


360 


TTTTTACCTT 


GATAAAAAAT 


ATCTTCCCCA 


ATCTCTGTCT 


GATATTGAGT 


TTTAACCCAC 


420 


TGACCATCGA 


CATACAGCTG 


GGCCCTATCT 


CGACCATCAA 


TGATACGAAG 


TCTTTCTTCT 


480 


TCTGCATCCC 


AGTTTGTTTC 


TGTTCGATAA 


AGTAGGTAGC 


CATAACTTTG 


TCCCAGCTCC 


540 


TCCATCTTTT 


GAGGATAGAG 


ACTTTCTACA 


GGGACTTGAC 


AAGCTATCTA 


AGGTTTCAAA 


600 


CAAAGAAACT 


TTTTCAACTA 


GTGGAATAGC 


ATCCAACTCC 


ATACTCTCTT 


TGTAGAGTGG 


660 


TTCCAACTGC 


GGATACTCTG 


AAAAATGTGT 


TGCCATCATC 


TTCTTGACTG 


CAAGATATTT 


720 


AGCAGTTGGA 


TTTCCTTCTT 


CATCCAGAAG 


GGCATCGTAA 


TCATAAGACG 


TTAACTTGTG 


780 


GCAGGTCCAA 


ATTCCTCGAG 










800 



(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



CTCACGCAGT TCTGTTACCA AAGACCGTTC AAAAAGCAAA CTAGCAATAC TTATAACTTA 60 

TTAGAAATTC CTAATTTTAC CTTGCTAAAC GAATACAGGT CACGATGCCA TCTGAGTCTG 120 

TGATTTCCAA TTGGCTCGAT GTCATCCATT TGATTGGTAC GTCAACTTCT TGTGCTCGTT 180 

GGGCAATCGT TAACAGTTCT TCTTTATGTG CAACTTCGAT GACATAGTAG GCTAAACCTG 240 

GTAAGCCTTG CTTACGCGGA AGCCAGACCT TTTCCTTCCC CATTCGTTAA CTGCTAAATG 300 

ATGATGGTAA TCTCCAGCCG CAATCCAACT AGCGCCAGGC ACACTGAATT TATTCTCTAG 360 

CCCTAACACC TTTTGATAAA ACTGGCTGGA CTTTCGACTA TCCTTGACGG AAAGATGAAT 420 

ATGCCCCATT CTTGTACCTT CTGCCAGGAT AAAGGGCTCT ACTCTTTCCC CCAACTCATA 480 

AATGTCCTGC GCCGCAAGAG CCTCCGTCAC TCCGATAATG CGTCCATCTT CTCGAATATC 540 

CCATGTGGAA ACTGGCTTAT CTCGATAGAG TTCAATGCCA TTTCCCTCCA AATCTTCCAA 600 

GTAAATAGCT TCACTGTAAC CATGGTCTGC ACCGCCGACA AGAGGAATTT GCAAATCTGT 660 

CAGATGTTTC AAGACATCAG CCAAAGCCTT GCGTGTGGGC AAGAGAATGG CCAAATGGTA 720 

AAGACCATAA TGTTCCCTCA CTTCTCCGCT CTTCTTGTGC TTGAATCAG 769 

(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: rnir leic acid 
<C) STRANDEL.VESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TTACCCCAAG CCTGCAACTT CCCAAGCTGA TTGGTTGAAT GATTTGTCAT CAACACCACC 60 
AGTATCAGTG ACGATTGCTG CTTTTGTCTT CACATCAGAA GATGAAGCTG CGTTACGAGA 120 
AGAGCGGTTA CCACATGCAG CAAGTCCAAC TGCTGCCACT GCAACTAGGC CAAGACCTAG 180 
CCATTGTTTC TTGTTCATTA CTGAACCTCC TAAATAAGAT GTGCAACGAT GTTGCAAGTA 240 
TGGATTGATT GGTCACAAAG ACCCTTGCCA CTCAAAGAGC GACTCAGACT AATTTAAGTC 300 
TGTAAAAGAA TATGGAAGTA ACTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC 360 
GACTAAGGTC ACTTTTAAAT CTTGTTCAAA AAATTCGACC ATTACTTGGC GACAAGCACC 
ACATGGTGAA ATCGGTTTTT CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG 
GCCTTCAGAT ATAGCCTTAA AAATAGCTGT TCTCTCACCG CAATTGGTCC AAAGGATAGC 
TAGCATTTTC GATATTCACT CCCGTGTTAA ACATTTCCGT CTTTGGCTAC TAAAACTGCT 
CCGATAGGAA AGTGAGAATA GGGGACATAG GCATGTT 



420 
480 
540 
600 
637 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CTGAGGATAA TGACATCTGC CANAGGAGCA AAACTAGCAA AATCATCNGT TGCACGGTCA 60 

ATCATGCCTT CTTTCAAGGC GATATCTCTC GAAGCTTGAC TACGGATTAT AACCTAAAAT 120 

TTCATAATCT GGATGATCGC GTTTGATACC AAGTGCCATA GAGGCTCCAA TCAACCCAAG 180 

ACCTGCGATA TAGATTGTTT TTGCCATAGG AACTCCTTAA TAGTTCTTTG TATAGTCTCG 240 

GTGTTTGGCT ACCGCTTCTT TTAGTTCCTC AAGATTATCT GATGAGAATT TTTCGAGGAT 300 

TTCTTGCGCC AGAACCGTTG CTACAACTGC TTCCATGACC ATTCCTGCAG CTGGAAGAGC 360 

AGTCGGATCA CTTCTCTCCA CGGTTGCCTT GTAAGGTTCG TGGGTTTCGA TATCCACACT 420 

CATAAGAGGT TTATAAAGAG TAGGAATGGG TTTCATGACC CCACGAACAA CGATGGGTTG 480 

CCCATTAGTC ATACCACCTT CAAAACCACC TAGATTATTG GTACGGCGAG TATAACCGTC 54 0 

TTCTTTAGAC CAGAGAATTT CATCCATAAC TTGGCTGCCT TTACGATAAC CAGCCTCAAA 600 

GCCAAGACCA AATTCCACCC CTTTAAAGGC ATTGATAGAG ACAACAGCTT GAGCCAATCT 660 

TGCATCCAAT TTTCTATCCC ATTGGGACAT AGGAACCAAG ACCAACTGGG AACGCCTCCG 720 

ACGACTGTCT CCACAACCCC ACCGATGGTA TCACCATCAC GTTTGATTTG GTCAATA r AG 780 

TCCTTGATTT CCTGTTCTCG TTCTTGGTTG ACAATAGAAA ACTTCCAGAC TGGGCA^CTC 840 

TTTGCTTTAA TTTCCAGCGA CTGTCAGATT TTCCAGGAAC ATCCATTCCC TTGCCACCAA 900 

AGACCACGAC ATGGTTGGCA ATCTCCATAT CCAGATTCGA 940 



(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GATTCTACAG 


GATCCATTTT 


ACTATCTTAC 


GCGCCGGGAA 


GTAGGCTGAG 


ACATAACCAA 


60 


GTAATAGAGC 


CAACAACTAG 


AGTTCCTAAA 


ACAGATAAAA 


GATTTAATTC 


AAAAACCTTA 


120 


GTGATGGATG 


GGTAAAAGTG 


ACTTACAATC 


GCATTCGCCA AACTTCCCAC 


CCCTTGTGCA 


180 


ACCAAAAATG 


CCAGCAGCAA 


GGCGATGCCT 


ACAATCCAGA 


TAGCCTCGTA 


AATAAAAATT 


240 


CCTTTGACAT 


CACGATTCTG 


ATAACCAACT 


GCTTTCATGA 


CACCTATTTC 


CTTGGAACGT 


300 


TGCATGATAT 


TGATGTAAAT 


AATGATACCA 


ATCATAACCG 


CTGCTACCAC 


AATAGCTTGT 


360 
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GATGAAAGCA 
ACTCTCTGTT 
TTTGTCTGTG 
GCCTCTTTCA 
TCATCTTCAT 
TCGGCAGCAC 
AAATTTTTTC 
CTGTTCATTT 
AACAGAGATA 
CTCAAATCCA 
TCCCNAAAGG 



CAATCAATAA 
AAGAAAGCAC 
ATGGATCTTT 
AAATCGTTTC 
CATTGATTAC 
TTTCTACAAT 
CTGTCTGTTC 
GTATCAGTAT 
CTTGTATCAT 
TTTCTTGCCC 
ACTCTCCAAA 



TCCCTGAATA 
AGTATACTTC 
GAGTTCCAAG 
CATTTGATGA 
ACGTACAATC 
GCTGACTGAG 
ATTCCAATTT 
AGAGGGATCC 
CATAAAGACT 
ATCTTTTCTT 
TATNNNATAG 



ACACGAATAA 

TTATTTTTCT 

ATAAAATAAG 

GACAGCATGA 

TTCGTTTGAA- 

ACTGATTTGC 

TTTAGTAAAG 

AGCCAACACT 

CACTACTTGA 

GCCCATCTAT 
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AGGTAATCAC AATATCAAGA 420 

GTAATTCTTC TGTTACTACT 480 

ATACAGCTTT CGTAAATCCA 540 

AACTGTTGCT GTCCTCCATG 600 

ATTGAGCAAT CTTACTAGTT 660 

CAATAAGATC ATTAGCTGTC 720 

CTGCTTGGAA TCGTTAATCC 780 

TTGTCCGTCT CATTATTACT 840 

GCATAAGAAG CATCGTTTGA 900 

AGTAATATTT GACATGTTCA 960 

990 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



ATCGGCTAAG 
AAGATGGCTA 
ACCAAGGTGC 
GAGGCTGCTA 
GGAGTCCCAC 
GCAGGTGGGG 
GATGGTGATA 
CAAGCCTACA 
AAAACGAATA 
GGTACTTTCT 
GTGGAGGACT 
TGTAATACTA 
AAATGAGGCT 
CGATAGCTAA 
GAATGGCCTC 
GTTCATCATG 
CCTGAATGAT 
CCATTCCTTG 
TAATACAGGT 
ATTCTTCAAA 



ATGCGTGGAG 
TGGAGTCAGG 
GTGAAATCCT 
TAGAACAGGC 
ACTATAATAA 
CAGCTTCTGC 
TCAATCCAGG 
AAGATATGGA 
GGAGTTTTAG 
CTTAATGTCA 
TCCTTGTTAA 
AGAGAGGGGA 
GACGCGATCT 
ACTATCGCTG 
CTTCATTAAG 
ATAGATTCCC 
TTCTTCTTGG 
ACTGAGGAAA 
ATGTCCCAGG 
GGCAGAAATC 



CATCACAAAT 
TGCGACAGCT 
CGGCGGAGGA 
GCTAGGTGTT 
TCGTGCTCTT 
TACAACATAC 
TCGCGTCTTT 
TGAACGTAAG 
AACTCTATTC 
GTCTGGTTCC 
GAATATCCAT 
GGATAGACCT 
GGCAGGCTGA 
GCTGCGAAAA 
TCATAGCCAG 
CTCGCTTGAC 
TCTGTAGTTT 
TAATCGACAA 
GAAAGTGTAT 
TGAGCTTCTT 



TATCCTTATG 
GTTGTTGCAG 
GCAGACGCAG 
CTTCATAATG 
GGTTCGACAT 
GATAAGCAAT 
ACTTCAAGTT 
ACCATTAAGT 
GTTTTTTATG 
CAGCATGGTC 
ACCTGCTCGG 
GTTTGGTCAG 
TTCCAGCTTC 
ATGCTGGCGG 
ACTGGGCAGT 
TGTAGTTTTT 
CTTCAAGGCC 
CCTGTTTCAT 
CGCTGTCTAG 
GATAGGATTG 



AGCCGTCATG 
AACGTGGTCA 
CACTTGAATG 
GAGGGCGTAT 
TTATGCAAAA 
TTTTACTAAA 
ATAAACTGGA 
CTATGATTGT 
TTATCCTATT 
AGGCTAGGGA 
CCCATTTCTT 
ACTAAGTGTC 
TTGGAGGGCA 
AAGTTGGTCT 
AAATCTTCCT 
GAAGTTTTCT 
TGTTAGAATC 
AGCAGTGTAA 
AAATACAAGA 
ATTCTGCTAG 



AAGACGTCAA 
AGAACGAATT 
TGTTGGTACG 
GGGCTTTGTA 
TATCTCTGTA 
AGCCGTCCTT 
AGATATCGAC 
AATCGAATAA 
CTTGATTTAA 
TTTTCCGACC 
CAGTATAAAC 
GTTAAAAGGA 
CGGAGGGCAC 
CCCAAGCTCT 
TGAAAGACCA 
AGACGCTTGT 
CCGATACGGT 
AAATCCGTGA 
GGCTTTTGGT 
GGATAACAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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CAAAGGCAAT CCTCCCTTTA TCCACCTCAT ATTTGAAACC ATTTCAGCTC TTA 



1253 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 



CGTGTACCAT 


CAAGAGGCTA 


ATTGATGTTT 


CTTCCCCCTG 


TGCCTCTCTT 


GTCAACATTT 


60 


CTGTTTGAGC 


CTTATGCGCC 


TGTAATAAGG 


CAGCATTCGC 


ATCAGCTAAT 


CTTCGATTGG 


120 


CTTTAGAGAA 


ATCCCCTTTT 


TTTGCTGCCT 


GAATAGCCTC 


CATAGCATTA 


CTTTTGGCTT 


180 


CCCCACCATA 


CATAATTAGC 


CCCATTACAG 


ATTCTAAATT 


ACTTTCATCC 


ATAACATTCT 


240 


CCTCAATCTT 


AGTCTAATAA 


ACTTTCAGCT 


AAATCTAAGA 


CTTTAGAACC 


ATTCATCATT 


300 


CCATAATCTG 


TCATCGGAAT 


AACCGCCACA 


GGAATCTGTC 


TATCTTTTAG 


TTTTTCTTGA 


360 


AAGTCCCCTA 


GTAAATAGCG 


AACTTGAGGA 


CCTAAAAGCA 


ACACATTTAC 


TTCTTTTGTT 


420 


GCTACAATTT 


CTTCTGCTTC 


AGGAGCTGGA 


ACTGCAAAAA 


TTTCT CATC 


CAACCCCTTA 


480 


TCTTCTGCTG 


CCTTTTGCAT 


CTTTGTCACT 


AGCAGACTTG 


TACTCATACC 


TGCTGCACAT 


540 


GCTAACATAA 


TTGTTACTTT 


AGCCATAGCT 


TACTCCTTTA 


TCACCTATTT 


ACTGACCTAG 


600 


CCAAGCGAGA 


TACTGTATGT 


CTCACTCCTC 


TAATTGCCTT 


GCTCAATGCA 


TAAATATTAT 


660 


CGACTGTTGC 


CAGTCCCCCA 


TATCCTGCGT 


CACCTATATG 


TTGAATATCA 


ACTCCACAAA 


720 


TTTTATTTCT 


AAGTGCAATT 


TCCTTAATAG 


TATCTGTATC 


AGATGTTTCT 


TGGCTAGTAC 


780 


CAATAGCACT 


CAATACAAGA 


CCACCCTTAC 


TATGAACGAG 


ATCAACGACT 


TCACGCAACT 


840 


CTTGGTCATG 


AAAGGCTGGA 


ACGGTTCCAA 


CTGCTGGAAC 


AAGTATCACA 


TCCGCACCTG 


900 


CTTCCAACAA 


TTGCTCTGCG 


ACAGAAAGCT 


CTGCCACAGG 


CTCATTCACT 


CCTGCCCCGT 


960 


GCATCTTACC 


TGCAATAATC 


AAACCAGAAA 


AGTTTTCCTT 


AGCAGTTTGA 


ACAGCCTTAA 


1020 


TGATTTCTCG 


ATTGCTAACT 


CCTGTTCCAG 


GATTTCCAGT 


CAAACAGACA 


AAGTCAAAAC 


1080 


CTAACTCCTA 


TATTCGCTTC 


AATGTTTCAA 


CACTAGCAAC 


ACGACCTGCA 


ACAATTTCCT 


1140 


GTGTTTCCTC 


TAGCATTCTT 


TGCAGATGGG 


TCAATCGGTT 


CCAAATTTAC 


ACCAATTGGA 


1200 


CAAGCAACAA 


G 










1211 



(2) INFORMATION FOR SEQ ID NO: 62: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



94 



WO 97/37026 

(ii) MOLECULE TYPE: Genomic DNA 



PCI7US97/0S3O6 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CCACACTAGC AGCCTATGGA CTCACTAAAA GAAAAGAAGA CTAAGTCTTT TCGATAAAAA 
ATAAACAGCG AGATTGAAGC TCGCTGTTTA TTTTTTAATT AATCACCTAG TCCAAGACGT 
TCAAAGATAT CATCCACTCG TTTGGTGTAA TAAACTGGGT TGAAGATTTC ATCGATTTCT 
TCTTGTGTGA GACGTGATGT TACTTCTGAA TCTGCCTCAA GAAGTGGTTT AAAGTCTACT 
TGGTTGTCCC AAGAGTAGGC TGTTTTTGGT TGCACCAAGT CATAGGCTTG CTCACGGGTC 
ATGCCTTTTT CAATCAATGT CAACATAGCC CGTTGGCTAA AGATAAGACC AAAAGTCGAG 
TTCATGTTTC GGATCATATT TTCTGGGAAG ACTGTCAAGT TCTTGACGAT ATTTCCAAAA 
CGGTTGAGCA TGTAGTCAAT CAAAATGGTC GTATCTGGTG TGATGATACG CTCAGCTGAT 
GAGTGAGAAA TATCGCGTTC GTGCCAGAGA GCGACGTTTT CATAAGCCGT AATCATGTGA 
CCACGAATGA CACGCGCCAG ACCAGTCATA TTTTCAGAAC CGATTGGGTT GCGTTTGTGA 
GGCATTGCTG AAGACCCTTT TTGCCCTTTA GCAAAGAACT CTTCTACTTC GCGTTGCTCA 
GATTTTTGTA GACCACGAAT CTCAGTCGCC ATACGTTCGA TTGAAGTCGC AATGCTGGCA 
AGAACCGCAA AGTACTCAGC GTGAAGGTCA CGAGGAAGGA CTTGTGTTAA AGATTCCTTG 
GGCACGGATG CCAAGTTTAT CGCAGACATA CTCCTCTACA AATGGTGGGA TATTGGCAAA 
GTTCCCAACC GCACCAGAAA TCTTACCAGC TTCTACACCA GCAGCCGCAT GCTCGAAGCG , w 
CTCCGATATT GCGTTTCATT TCGCTGTACC AAGTTGCTAA TTTAAGGACC AAAGGTTGTC 960 
GGCTCAGCGT GCACACCATG AGTTACGCCC CATCATGATG GTGAACTTGT GCTCCTTGGC 1020 
TTGTCAGCGA TGATATTAGT GAAGTTTTCA AGGTCACGAC GGATGATGTC GTTGGCCTGC 
TTGTAGAGGT AACCATAAGC AGTATCCACC ACGTCGGTAG AAGTTAACCC ATAGTGAACC 
CACTTGCGC? CTTCACCAAG AGTCTCAGAA ACCGCACGCG TGAAAGCCAC CACATCGTGG 
CGCGTCTCCT GCTCAATTTC CAAAATACGG TCGATGTCAA AGTCCGCCTT CTTGCGAATC 
AAAGCCACAT CTTCCTTAGG GATTTCCCCA ACTCAGCCCA TGCCTCGTCA GAGAGGATTT 
CCACCTCAAG CCAAGCACGG TATTTATTTT CTTCACTCCA AATATTCGCC ATCTCAGGGC 
GAGAGTAACG GTTGATCATG TGTTAATTTT TCCTTTCTTC TTAAG 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



CTATTCTGAG TATAAAGTTG GCGGAGGCTG GAACTACGCT CGTTATGAGG TCATAAACTA 
CTATACTGGA GGTTATTAAT CCTTAAAGAG TGAGAAAAGG AGGGCTGGAT ATGTTAAATC 
TTACTCATGT TACCTTAAAA ACGCGACAAG TCATCTTGCA AGATGCGGAT TTTACCTTTA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1080 
1140 
1200 
1260 
1320 
1380 
1425 



60 
120 
180 
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AAAAGGGTAG GATTTATGGC CTTCTTGCTA 
GAGCTATGAG CAAGTTGCTT CCCCTTAGTA 
TTTATTATGA GAGCGTTGAA TGGCTGGATG 
TCATAAAAAA ACATCTGGAA GTCAGACCTA 
ATGGCTGACT ATATCAGTCT TCCCATCCGC 
GTGATTGCTA TGTATTTTCT CAGTCAGGCC 
GGCTTAGACG AGTATTATAG ACAGAAGTTT 
GAACAGCTGG TTCTTTTAAG TTCCCACTAT 
GTAGTAACCA TTCATCAGGG GCAGATAGAA 
TTTTTATTGA AAAAAGTTTT CAAAAGTCGT 
TCTGTACTCG GTGTTACCTT TTATTTAAAT 
AGAGAGTTGG AAACTCGTCT TGTAGACCGT 
CTCTCCCAAA TGTCTGATAC CAGCTCGGAG 
GTGCAAAAAA ATCTTTTGAC GCGAAAGACA 
TGGAAAGAGG CCTACTATCT GCAGTGGCAA 
AATGACCCGA CTGCTAGCTC TGGCTTAAAA 



PCIYUS97/05306 



TCAATGGCTC 


gggaaagacg 


ACACTATTCC 


o a n 


GTGGACACAT 


CGCAGTTCCT 


CCTTCTTTGT 


300 


H ^ % MflMi m m 

GAAACTTAAG 


TGGGATGGAC 


TACCTTCGTC 


360 


AA(Ji 1\»AGAG 


ATvjAAATCGC 


CTACTGGGAA 


a o n 


AAGTATTCCT 


TAQuQATGAA 


/■•/■•a a o^fTvrv* 

GCAACGCTTG 


4ovJ 


AAA I\jV_ 1 VjvjL. 


Hi/"** a rnrv* A TV* A 
1 V- A 1 ouA 1 VjA 


A fTvn A A A A T» 

GATTACAAAT 




TTTGATAGGC 


TAGGACAAAT 


O/"* A fft A/ 1 * A ^ A A 

CGATAGACAA 


600 


AAGGAAGAGT 


U\iGTTGATAT 


CTGCGATAGA 


£ c n 

660 


GAGGTTTAGT 


TTATGAAAGA 


TGTTAGTCTA 


720 


TTAAACTGGA 


TTATCTTACT 


TTTATTTGCA 


780 


AGTCAGACTG 


CAAACTCAGT 


CAGCTTGGAG 


840 


GAGAGAGTCA 


TCAATGAAAA 


TGAAGAGAAA 


900 


GAATACCAGT 


TTGCTAAAAA 


TAATTTAGAC 


960 


GAAATTCTGA 


CTTTATTAAA 


AGAAGGGCGC 


1020 


GATGAAGAGA 


AGAATTATGA 


ATTTGTATCA 


1080 


ATGGGGGTTG 


ACCG 




1124 



(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



CAGAGCAACT 


TCGTCAAATC 


TTCATAGGTG 


AGGTAACCAA 


TTGGAAAGAG 


GTTGGTGGTA 


60 


AGGACTNACC 


CATCTCTGTT 


ATCAATCGGG 


CAGCCGGCTC 


TGGCTCTCGT 


GCTACCTTTG 


120 


ATACTGTCAT 


TATGGAAGGT 


CAGTCTGCCA 


TGCAAAGTCA 


GGAGCAGGAT 


TCAAATGGAG 


180 


CGGTAAAATC 


AATCGTATCA 


AACAAGTCCA 


GGAGCTATCT 


CTTATTTATC 


TCTTACCTAT 


240 


ATAGATGATT 


CGGTCAAAAG 


CATGAAGTTG 


AATGGCTATG 


ACTTAAGTCC 


AGAAAATATA 


300 


AGTAGCAATA 


ATTGACCCTT 


GTGGTCTTAT 


GAGCATATGT 


ATACATTGGG 


GCAGCCCAAT 


360 


GAGTTGGCTG 


CAGAATTTCT 


CAATTTTGTT 


CTCTCGGATG 


AGACCCAAGA 


AGGGATTGTC 


420 


AAAGGATTGA 


AGTATATTCC 


GATTAAGGAA 


ATGAAGGTTG 


AAAAAGATGC 


TGCCGGAACT 


480 


GTGACAGTGT 


TGGAAGGGAG 


ACAATAATGA 


ATCAAGAAGA 


ATTAGCTAAG 


AAAATGTTGC 


540 


TTCCATCAAA 


GAATTCTCGT 


CTGGAGAAAT 


TAGGAAAAGG 


TTTGACCTTT 


GCCTGTCTTT 


600 


CTTTGATAGT 


CATCCTTGTG 


GCCATGATTT 


TGGTTTTCGT 


AGCGCAAAAA 


GGCTTGTCGA 


660 


CCTTCTTTGT 


CAATGGTGTG 


AATATCTTTG 


ACTTTCTTTT 


GGGAGGAACT 


TGGAATCCTT 


720 


CTAGTAAAGA 


ATTTGGTGCC 


CTTCCTATGA 


TTTTGGGTTC 


CTTTATCGTT 


ACCATTCTCT 


780 


CAGCCCTTAT 


CGCAACACCC 


TTTGCTATTG 


GTGCAGCAGT 


TTTTATGACC 


GAAGTATCAC 


840 


CAAAAGGGGC 


GAAGATTTTG 


CAACCAGCTA 


TTGAACTCCT 


GGTTGGGATT 


CCTTCAGTAG 


900 
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TGTACGGATT TATTGGCTTG CAAGTCGTCG 
CTGGTTTTGG GATTTTGTCA GGGATTTCCG 
CCTTTATGAC AACGGATAGC TTGCGTGCGG 
GCTATGGGA 

(2) INFORMATION FOR SEQ 



PCIYUS97/05306 

TTCCCTTTGT TCGCAGTGTC TTTGGTGGGA 960 
TCCTCTTTGT CATGATTTTG CCGACCGTAA 1020 
TTCCTCCNTT ATTATCGTGA AGCCAGTTTC 1080 

1089 

ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



CTNTNTTTAA 
AAAAAAGGGC 
AGCCTAATTC 
CTTCAGACCA 
AGATGCGGTC 
GCTGCAACTG 
CCAAGGCGCT 
GTAAGGCTAC 
CTGTCAGCTG 
CCTTTTGAAT 
GATCCTTTTG 
TTCTCCCTGA 
CACCACTTGG 
CTAAAATCCT 
TTTCACAAAT 
TTTATCCCAT 
GTCGCTTGCG 
CTTGAGTTCA 
ATATTGGTAG 
CAGATAGTCG 
TTTCAANTAC 
GTAGTAAATC 
NTTANTTGGT 
CAACATACNN 
GGACTGCCTA 
AG 



ACCTTGACCT 
GCTGGTAAAA 
TGCCAAGATT 
GTCTAGTTTA 
GCTGAGANTG 
CTTGTGAATC 
AAAGGCCTCA 
ACGCTGGCGC 
GAAGGTCGCA 
CAAGAGGGGC 
CAACATATAA 
CTGAACTTCT 
GCCTAGGATG 
CTCCTGACCA 
TCGTTGGTGA 
TTATAGAAAG 
TATTCTTTTG 
GGTGCATTCT 
CCTTTTNTGA 
TTGTTTGCGA 
ATGAAGTTAG 
CNAGCAGTAT 
ACTNTTTCAA 
GTTCANTTGG 
CATTNTCACA 



TANTTGANNA 
GGGATAATCT 
TGACGTTTGT 
ATTTCTGAAA 
AGGGCCTCCT 
TCAAGATACC 
TCTAAGAGAA 
ATCCCACCGC 
AGAATTTTAT 
AGAATGATAT 
CTCACGTGCC 
AAAATCCCAG 
GAAACCACTT 
TAGGCATAAC 
AGCCTTTGTC 
CATTCCAGCG 
ACAAGTATTT 
TGATGAGAAT 
TGGCTTGGAT 
TGATAACTGG 
CATCTACACC 
CAAAGACGCC 
CCTTCTCACA 
GTCATTCCAT 
TTACCGAGAG 



TTATACCGAA 
TCACCAACTC 
NGGCAATCTT 
CAATCTGCCC 
CAATACTATG 
AAGCGTGGAG 
AGAGCTTGTG 
TAAGTTCATG 
CCGCTCGGGA 
TTCCAAGCAC 
CCTTGGGATT 
CGATTAGATT 
CGCCTGAAGT 
TGACGTGCTC 
TGTCAAGTCT 
AGCTGCGTCA 
TTGAGATTCG 
ATCTGCAGCT 
GACTTTGCGA 
TGAATATTAN 
TTGAGATTTA 
ATTGGCAATC 
GTCTCCNCCT 
GTCCCATATT 
ATGATCCCTG 



ATTNTCTTCA 
CCTATNTTTC 
TNGGACTTCN 
AGGGNGATTT 
CGTGANGATC 
TTCCATCTTT 
CCCAAAAAGG 
AGGATACTTG 
AATAGCTTCT 
CGTCTTGTGC 
TTCTTCACCA 
AAAGAGGGTG 
CACCTGTAGG 
TAGTCTAATT 
TCTTTAAGGA 
AATTGACCCC 
ATGACAAAGT 
TCTNCTGGAT 
GCTTCTTCTT 
TCAAACTCCT 
GCAAGGATAC 
GGTGTGATTG 
TGAGATTCTA 
TCTTACCAAC 
ATGTATTGTG 



TTTTTAAAAG 
TACTTATCTA 
TNGTCCTCAT 
NTCAAGATAT 
AGGGTTGTTA 
GTCATCTCAT 
TAGGTCCGGA 
TCTCTTACAG 
GCCTTATCCA 
TCCAAGAGCA 
TCAAGGACAA 
GTCTTTCCAA 
TTGATATCCT 
TCTGTCATTA 
TACCATTTTC 
ATTTTTCCTT 
CACGTTTTTC 
GTTCCATGGC 
TGTTATCTTT 
TGACATAGTC 
CATCCCAACC 
AGTTTGAGTC 
CCAAGGTTTT 
CAAGTCTTTT 
TTCCACCNAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

132C 

1380 

1440 

1500 

1502 
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(2) INFORMATION FOR SEQ ID NO: 66: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



ACTCTTCAGC 


GATTTCTTGG 


CTGAGAATGT 


CCTCATAGGC 


AGGGAGTTGG 


ATGTCCTGAC 


60 


CATATTTCTT 


CTTGAGGGCA 


GTAGTAACTA 


GGCGATAAGC 


CAGATTGGCA 


TTACGAGAGA 


120 


AGATAGGCCC 


GNGGAANTAC 


GAACCAAAGA 


CATTCTTATA 


ATGAACCCCT 


TCACCGACCT 


180 


TTTCTTCGTT 


GTTTCCATTT 


CCATAGACAA 


CCTGTCCCAG 


CGGTTTTTGG 


TCATCAGAGA 


240 


GGAAGGTACG 


ACCTTGGTGA 


TTTTCAAATC 


CATAGTAGGT 


TTCATCGAAA 


TCTTTCATTG 


300 


TGAATCTTGA TGTCACCGAT 


AAAACGGTTA 


TTGGTCTGGT 


TGAGCGTGTA 


GTGTCCCATG 


360 


ACCCCTAGCC 


CTTCGATACG 


TTTTCCTGAA 


GCTTCAACAT 


AATATTGACC 


CAATAGTTGG 


420 


AAACCACCGC 


AGATAGCCAG 


AACTACACCG 


TCGTTTTGGA 


TGTAGTTGTC 


AATGCTCTCT 


480 


TTTTTAGCAG 


GTAGGTCGTC 


TGCAATGATA 


CTTTGTTCAA 


AGTCTTGACC 


ACCACCGAAA 


540 


AAGGCGATGT 


CGTAGTGATT 


T" r ATCAAAG 


TCATCATGGA 


GAGAAACGAT 


GTCAACGGTC 


600 


ACATGGGTTC 


CCAGTTTTTC 


AGCCACATAC 


TTGAGCATGA 


GGATGTTTCC 


ATTGTCCCCG 


660 


TAGGTATTCA 


TGAGATTTCC 


GTAGAGGTGG 


GCAATGTTGA 


GCTGATAGGG 


GTAATTGCCA 


720 


TCTTTTGAGG AAAGTGAAGT 


ATAAACCATT 


AGTTCATCTC 


CTTTCTAACA 


ATCTGACGAC 


780 


TAGCCAGCAG 


TTCACGAAAT 


TCCAGCATGG 


CAGTATAAGT 


TGCCAGAATA 


TAGGCATGCT 


840 


TGCAGTCTTG 


ATTCTCAATG 


GTCTTGAGAA 


CTTGCTCCAG 


ATTACTCGTT 


TCAGTGATTT 


900 


TCTCAGCTGG 


ATAGCCAGTC 


ACTCGGAGGC 


GACGAGCGAT 


TTCAGAATGA 


CGAACACCGC 


960 


CAGCGTTGAT 


TTCAGGAATG 


TCCATGTCAG 


TGATTTGCTC 


AAAGTCTGCA 


TCCCAGATCC 


1020 


AG 












1022 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CCTCCAGGTC TCGGGAAACG ACCATGGCCT TTGTTATTGC CAACGAACTG GGAGTCAATC 

98 
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TTAAGCAGAC 
AGTTAGATCC 
AAGAGGTGCT 
AGGGTAGTCG 
GGGCTGGTAT 
ATTATGCCCA 
AAATCACTCA 
CCAATCGTCT 
ATGATATTAT 
ATGTGGATCA 
TAGGAACTCT 
CCTTACTTTG 
GCCTAAGGCA 
GAGGCTTTTT 
TCTGAAAGAC 
AGACAAATCC 
TTTTCTTATG 
CAGTGGGAAT 
ACCAGTTCTC 
TTGAATGAAG 
TTCAAGTTCG 
CACGTTTATC 



GTCGGGTCCA 
TGGGGATGTA 
TTATAGTGCT 
TAGTGTTCAT 
GCTCTCCAAT 
TGCTGACTTG 
TGAGGCAGCA 
CCTCAAGCGC 
TACCGATAAG 
AAAAATCCTT 
TTCTGTTGAA 
ATTCAAAAAG 
TATGAGCACT 
AGAGAAAATC 
TCGTGGTTGG 
CCTTTTGATC 
AGGAAACCTT 
TGAAAAATCA 
GTGATGCTAT 
AATCAGATTT 
TCCAACTCCT 
CAAGAAAAAT 



GTCATTGAAA 
CTTTTTATTG 
ATGGAGGACT 
TTGGAGTTAC 
CCGCTACGGG 
ACAGAAATTG 
TCTGAGTTGG 
GTGCGCGATT 
GCTTTGACTA 
CGTACCATGA 
TATCGCCGAA 
GTTTTATCCA 
TAGGTTATGA 
CAGATATGAT 
CAGCAGGTTC 
ATGAAACAGA 
ATTACTGGAG 
GGTCTATATG 
GAGTAAGTAT 
TGAACTCTAT 
CATTTCTTAG 
TGGCAGGAGA 



AAGCCGGAGA 
ATGAGATCCA 
TCTACATAGA 
CACCTTTTAC 
CACGTTTTGG 
TCGAGCGGAC 
CCCTACGTAG 
TTGCCCAGAT 
TGCTGGATGT 
TTGAGATGTA 
GAACGTGAGA 
TGCGGACACC 
ATACAGTGAA 
GGCCATTCTG 
TGTCAGAAAT 
TATAGATGTT 
AAAAAGCTGT 
CATCAGCACA 
CCAGAACGGT 
GCACCTTATG 
AAAATGAAGA 
AAT 



TCTGGTAGCT 
TCGTTTGCCA 
TATTATGATT 
CTTGATTGGT 
GATTACAGGC 
GGCAGATATT 
TCGTGGGACC 
AATGGGGAAT 
TGACCATGAA 
CAGTGGAGGA 
CAGTTGAAGA 
GTCCTGGACG 
AAATAAGCAA 
ACGATCATCC 
TTCATTTGGA 
GATTTTCTTT 
GAGAGGATTT 
GTCCTCACAC 
GTACGACAGT 
GTTTGGAGGA 
CCGAATGGAA 



PCT/US97/05306 

ATTTTGAATG 
ATGTCAGTGG 
GGGGCTGGTG 
GCGACGACTC 
CATATGGAGT 
TTTGAGATGG 
CCTCGTATTG 
GGGGTAATTG 
GGTTTGGACT 
CCTGTTGGTC 
TATGTATGAG 
GGTGGCGACT 
GAAATCCTTA 
GAGACCTTGG 
ATCTTTTGTC 
GATCCCAGAT 
TCCTCAGTAC 
TGCTTCCTAT 
TGGACTGCGC 
TATTTTGAAT 
CTCTATCAAA 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 c . c 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



CTCCTGGAAA 
ACAATAAGGT 
GTTCTTCCAA 
GTTCAAAAGC 
GATGGTCTGA 
CATTACCAGA 
CTTGACGATG 
ATTTTCAGGG 
TTCGTGTTTA 



TTCTGGCAAG 
TGGTCCAACG 
GAGTTCGACA 
ATAGGACAGA 
AAAGACAAAC 
GGCAAAAACC 
CGACCAATTT 
CTGACCGCCA 
ATCTGACCAT 



TAGGCTGGAT 
ATTTTCATGT 
AGGGCTGCTT 
ACGCCGAGCT 
TCCACTGGAA 
GCTATTTTTT 
CATAGACTGC 
ACATAAGTCC 
ATTTTTCAAG 



GAATGTTGAC 
AGCCTGCTAG 
CGTAGTCTGT 
GCTTTGCACG 
ATTCTTCGGC 
TCATTTGATA 
TTCATCCAAC 
CACACCCATA 
GGCTTTGAAA 



AATCCGACCT 
GCAAACCAAG 
CTTGCTCTCA 
CTCAAGCACA 
AATCACCTGA 
ATGACACTTT 
AATTCTTTTA 
TTGAAGATTT 
ATTGGAAGCA 



TCATAAGCCG 
TCAATCTGGT 
AACTCCTTGA 
TAGGCATCAC 
AAATTTGAGC 
CGTTTTCTTT 
CACGCTCTAC 
CAAACATTTC 
CTGGAACTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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GCTTTCATCA ATTTCAGCAG 



FCT/US97/05306 



560 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CTTCAACACT 


TTATTTTTAT 


CTTTCTCCCA 


TTTCGTATGC 


TAGTTCGGAA 


ATTGTTGCTA 


60 


GAACCTCTCC 


GACTATTTGG 


GATGTTCTCA 


TTGCTTTTGT 


AGGAGGGATA 


GCAGGTATTA 


120 


TTGGTGCTAG 


GAAAAAAGAG 


ACCAATAATA 


TTGTTCCTGG 


TGTTGCTATT 


GCAACCGCCT 


180 


TGATGCCTCC 


TCTTTGTACA 


GTAGGTTATG 


CTATTGCTTC 


TGCTAATCTA 


AAATTTATCA 


240 


TTAGGCTCCT 


CTTACCTATT 


CCTCATCAAT 


TGTAGCTTTA 


TTGTCATTGC 


GACTTATATA 


300 


GGTGTTAGGT 


TGATGATGGT 


TAAGAAACAT 


TATTTTAAAG 


ATAATGAAGA 


AGACTCTAAA 


360 


ATGCGTAGGA 


TTTTGCTTCT 


AGTTGCTGTT 


TTGCTGATGA 


TTCCGAGTTT 


CATCTCTGCA 


420 


ACGACTTTAG 


" \ * AGAG AAAC 


GTTGAAAAAA 


GAGTCCCTTA 


AGAAATTTAT 


ATCAGAGCAG 


480 


TTTCAGGGGC 


ATAATATTTT 


GAAAAAAACC 


TATTCTAAAA 


AGACTCATAC 


CCTAAAGCTA 


540 


ACCATTTCAG 


GAAATTATTT 


GACAGAAGAA 


GAACTCGATA 


TGATTTCAAG 


TAAGAGAGGT 


600 


GACTATGGTT 


TAAGTGATGT 


TTCTGTTCAA 


GTTTCACAAT 


TGTCTGATTC 


AGAACAACTT 


660 


AGTAAGGAAG 


AACTGGTGGA 


GTATTTCTTC 


CAGTATATCA 


AGGATAAGGA 


AGCAAAAGAA 


720 


AAGGAAAAAG 


CTAATAAGTT 


TTATACAGAG 


TCTGAGGAGC 


AATAATTTCT 


TGAGAATAGC 


780 


TGGTTTTTCT 


CGTGAGTCTT 


CTATGTATAT 


CAAAGGAAGA 


CTGAGGTTTT 


AAGTATGAAA 


840 


CTTTTCTTCT 


ATTATAGTAG 
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(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 925 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TCTTGGCCAA CTGCATGGAG TTCAGCGGTC AATTTCAACG CACCTGAGAA ACAGACCCCT 60 
GCACCCCTGA AATCTCAGGA GACATGATGG TCTGGATGGA ATCAATAATG AGAAAGTCTG 120 
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GCTGGATACG CTACCACTTC TGCACGAACA CTCTGCATAT TGGTCTCTGC ATAGAGATAA 180 

AACTCACTAT CAAAATCACC TAAGCGCTCT GCACGTAGTT TAATCTGCTC GGCAGACTCC 240 

TCCCCACTGA CATAGAGAAC TGTCCCCACT TGGGACAACT GGGTTGAGAC TTGTAGGAGA 300 

AGAGTTGATT TCCCAATCCC AGGATCCCCA CCGATGAGGA CGAGACTTTC CTGGTACAAC 360 

TCCGCCTCCA AGCACACGGT TGAATTCCTC CATCTCCGTC TTGGTTCGAT TGACATTGAT 420 

GGAAGTCACC TCAGCTAGTT TCATGGGCTT GGTTTTCTCA CCTGTCAAGG ACACACGCGC 480 

ATTCTTGACC TCGGCAACCT CAACCTCTTC CACAAAAGAA GACCAAGACC CACAGTTGGG 540 

GCAACGTCCC AGATATTTAG GGGAATTATA CCCACAATTT TGACATACAA ATGTCGCTTT 600 

TTTCTTTGCG ATGACAAACC TCTTTCTATA TCTCTAACTC ACACTCAATC ACTTGGCAAA 660 

AATCAATCTT CTCATTTGGC ACAAACTGGC GCATGAGCAT TCGATGAGCA ACAACTACCA 720 

CAGTCTGATG TTCTCGATAC TTAAACATAC ATTCTAGAAA CCGAGACTTC ATTTCCGTAG 780 

CTGTCTCATA TTGAATAGGA CTATTAGGAA GCAACTCCCC CTTGTTTTCT AAAAACAGTC 840 
TTCTAGCTGT TTCAAAGTTT TCTATTCCTG TTTTATAGAC CTGCCATTCA TGTAATAAAG 
GCTCTACTCT TAAAGGAAGA CCCGT 



(2) INFORMATION FOR SEQ ID NO; 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

TTCTNGATCA AGTCAAGGAC TGCANGCCAC GACCGACATA GCGTTTGGCA ATGGAAACAA 
CCAAACGAAG ATTGGCTTCC GCAAGACGTT GTTTGGCTTC GATATCGCCA GCTTCAACAG 
CCAGTGCCAA CTCTTTCTCC TCTTCATTGG TCAAGAGAGG AACGACCCCT ATTTCTTTCA 
AGTACATACG GACAGGGTCA TTGACCTTAG CAGAAGTTGA CCCAATCAAG TCCTCATCGC 
TGAGTTCTGG TTCTTCTTCA TTGCTAAGAA CACGCGCACT TGGATTTCCT TCGTTATCTG 
TGATAGAAAT GCCTGCATCC TGAATCCGTT GCAAGAGATC TTCAATCCCA TCAGCGTCCA 
AGGTAAAAGG AATAACCAGA CTTGCATTGA TTTCATCATC TGTTGCTGTC CCTTTTTGCT 
TATGATTACG GATAAATTCT GCTACCTGTA CGTCAAATGT TGTTACTTCT TTTTGTTTTG 
TTGCCATTAT TACTCCATTC TTCTCTTTTG GGAAATTAAA CGTTCCAATT CTTCTAGGGC 
TGTATCTGTA TCTCCTACAT GGCTAGCTTC CTGCACCTTC TTTTTGATTC CCATATTGTC 
CTGATTCAAG AGAGCCTTGT TTCGAAGTCA 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 



900 
925 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
630 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



CTAGGGCCAT 


CTTCTCACCT 


CCTCCTTGTC 


AGTACATTCT 


TGCAATAGAA 


AAAATAAGAT 


60 


TGAGTCCCCC 


CAACCTTAAA 


TTTTTTCACC 


ATCTTCTTTT 


TCTTTAGCAA 


TTTGCTCTTT 


120 


GATTTTCTTT 


TCTTCTTCTT 


CTTTGCGGCG 


TTTTTCTTCT 


TCGATACGGC 


GACGCACTGC 


180 


TTCACGTTTT 


CCTTCTGGAT 


CTGGGTGAAT 


TGTAACGTTT 


CCTGATTCGA 


TTTCTTCTAA 


240 


AGCGCGAAGA 


GTTGATTTTT 


CAGACTTGAA 


ACCTTGAGTT 


GCTGGGGCAC 


CTGCTTCCAA 


300 


TTCGTGGGCA 


CGTTTTGCTT 


CCAAGATTAC 


GAGTGAATAT 


TTTGAAGGAA 


CCTTGTCGAG 


360 


CAAGGTATCA 


ATAGAGGGTT 


TTAACATCAT 


TTGCTTGTAC 


CTATTTTCTA 


AATTTTATCG 


420 


GGTAGTTGGA 


GATTTTGGTA 


ACATCTCCTG 


ATAGTGACCA 


ATGACACGAT 


CCACACAGAA 


480 


GTGTTCTGCT 


TCAATCACAC 


ATTTGACACG 


TTCAGCAGCT 


AGGGGTACCT 


GATCGTTGAC 


540 


AATCGCATAA 


TCATACTCAC 


GCATGAGGGC 


AATTTCTTCC 


TTGGCCTTTT 


CGATTCGTTG 


600 


GGCAATCACT 


TCTGCACTAT 


CTGTTCCACG 


ACCTACCAAG 


CGTTCTTGCA 


ATTCATCCAA 


660 


ATCTGGTGGT 


GTCAGGAAGA 


TAAAGACAGC 


ATCTGGAACC 


TTTTTCTTGA 


CCTGAAGAGC 


720 


ACCCTGAACT 


TCAATTTCAA 


GGAAAACATC 


GATTCCCTTG 


TCCAAGGTTT 


CATTGACATA 


780 


GGTCAGAGGA 


GTTCCATAGT 


AGTTGCCGAC 


ATATTCTGCG 


TATTCCAACA 


TCTGTCCTTG 


840 


/ CGAATC AG 
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(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



GCCAAACAAC 


GCATCCATAA 


TGGTTAAATC 


TTGATAAGGC 


AGAGCTACAT 


TCAAAACAGC 


60 


TTGCTGGTTT 


GTAGGCTTTC 


AATCAGAGCA 


ATCACTTCTT 


CAACCTTGTC 


AGCATCAAGA 


120 


GCAGCTGTCT 


CAATCTTTGT 


ACTTGTTTTG 


CCTTCCAGTT 


TAGCTTTCAA 


GTCATCGCAT 


180 


TTTGACTTGG 


TACGGCTAGC 


AATCATAATC 


TCTGTAAATG 


TTTCGCTATC 


TTGACAAATC 


240 


TTTGAAATAG 


CAACTTGGGC 


AACGCCCCCA 


CAACCAATAA 


CTAGTAAACG 


ACTCATTTTT 


300 


TTCCTTCCTC 


TTCTTCTAAA 


ATGTCCTCAA 


CATACTTGGG 


CAACATAAAG 


GCTCCCACGT 


360 


GTAAGTTTGC 


AGTGTAGTAT 


TCTGTGAAAA 


GCTGGCGTTT 


TTTCCAGCCT 


TCCTTGTCAA 


420 


AATCTTTGAC 


AGGGTGGTAT 


TTTTTCGATG 


CAAATCCAAA 


CAACCAATAG 


CCAGCTGGGC 


480 


TAGTTGGAAT 


ATGGGCCTGA 


TAAACCCGAC 


TGATTGGAAA 


GGCTTGATTG 


ACCTTGCGGT 


540 
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GCATGCTTCG GCAGGCCGAC TCATCCTCGT CAAAGAAAGG A 
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581 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TTCTGAAAAT CCATACTTCT ACCAAGGAGA TACGCTAGCG GTCTCGGAAG TTCTCATACT 60 

TCTATCGCTA TATCTGGCCA TCGCCTTTCT ATCCAAGGAT TGACAACAAG AGATAAGGCA 120 

GAGAAAAAAT TCTTGCTGGA TCAGCTGGTT GCCTGCGATG GTGGTACAGG TGTCATGCAC 180 

GAAAGCTTCC ATGTAGATGA TCCGACCCTC TACTCTCGTG AATGGTTCTC CTGGGCTAAC 240 

ATGATGTTCT GTGAGTTGGT CTTGGATTAC TTGGATATTC GCTAAGGGGC TCGCTTTAGC 300 

TCAACCGATT CTTATCAGAA TCACAAGTTT ACATTTAAAA CGTTAAAATT TAAATTTAGA 3 60 

ATGAGGTTTT ACTTCATGGA AAATGTTGTT GTACATATTA TCTCACATAG CCACTGGGAT 420 

CGTGAGTGGT ACTTGCCTTT TGAAAGCCAT CGTATGCAAT V"3GTGGAATT GTTTGACAAT 480 

CTCTTTGATC TCTTTGAAAA TGACCCTGAG TTCAAGAGTT TCCACTTGGA TGGACAAACT 540 

ATTGTCCTTG ACGACTAACT TACAAATTCG CCCTGAAAAT CGCGACAAGG TCCAACGCTA 600 

CATTGACGAG GGCAAGCTTA AAATTGGTCC CTTTTACATC TTGCAGGATG ATTATTTGAT 660 

CTCCAGTGAA GCCAATGTCC GCAATACCTT GATTGGTCAA CAAGAAGCTG CCAAATGGGG 720 

TAAATCAACC CAGATTGGCT ACTTTCCAGA TACCTTTGGA AATATGGGAC AAGCGCCTCA 780 

AATTCTTCAA AAATCAGGCA TTCACGTGGC GGCCTTTGGT CGTGGTGTGA AGCCGATTGG 840 

ATTTGACAAC CAAGTCCTTG AAGATGAGCA GTTTACGTCT CAGTTTTCAG AAATGTACTG 900 

GCAGGGTGTG GATGGTAGTC GTGTTTTAGG TATTCTCTTT GCCAACTGGT ACAGTAACGG 960 

GAATGAAATT CCAGTTGACA AAGATGAGGC CTTGACCTTC TGGAAACAAA AATTGTCAGA 1020 

TGTGCGTTGC CTACGCTTCG ACCAACCAAT GGTTGATGAT GAAACGGCTG TGAACACCAG 1080 

CCTGTCCCAA AAAAAATCTG AGCGAAGCCA TTCCGTGTGG CAAATGAACT CTTCCCGGAT 1140 

GTAATCTTTG TTCATAGTTC TTTTGATGAA TATGTTCAAG U80 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



CTCAAAAAGT 


CTGAAATTGG 


ATCATTTGGA 


ACCATCTGAG 


CAGTAAGTTT 


AACACCTTTT 


60 


TCTGCCAGTT 


TGTGGAAGTC 


TTCCACATCC 


TTGTCTACTA 


CGTTGATAGA 


ACGTGTAATA 


120 


GAACGAGTTT 


CTGGTGTTTG 


AGACATATTC 


CCAACATTAA 


GGGTTTCAAG 


TGGTACACCT 


180 


GCTTCTACCA 


AACCAAGGGA 


AGCGGTCTGG 


TTTACGAGCC 


ACGATAAAGA 


GACGTTGGCT 


240 


ATCGTATTTG 


CCAGCAAGAA 


TATTGGCTGC 


AGCTTTCTCA 


ACTGGCAAAA 


TACTCAATTT 


300 


CACACCTGGT 


GGTGTCGCAA 


GTTTCAAACC 


ACTCTTTTCA 


ATATCGTTGT 


TGACAACTTC 


360 


GTCGTCTACA 


ACCATAATGC 


GTGAAACATT 


TAGTTTTCCA 


GCCCAAAGAT 


TGGCTACTTG 


420 


TCCGTGGATC 


AAACGTCCAT 


CAATACGGCA 


TCCTACAATT 


GTCATAAGTT 


TTCCCCCTTT 


48C 


ATATGTTTTA 


GTGTAGGTTT 


ACGAGTTAAA 


TGAATCTCTT 


CTTTATATTG 


ACCTTCTGTT 


540 


TCAAAGATAA 


TGATGCGGTT 


GGCACCTTCC 


TTGAGATAG 






579 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



CGCATANCCT 


AAAAGTTTTC 


TTTGACATCT 


TCTGGAATAT 


CTGTATCATT 


AGCTTCCTGA 


60 


ACATCTCTAC 


CATAAATCCC 


AAAGGCTGGA 


ATCCCTTTTT 


GAGCATGTGA 


AGCTAGTACA 


120 


GCTGCAAGAT 


AGACAGCTCC 


TGGGCGTTCT 


GTCCCATTAA 


ATCCCCAAAT 


AGCATGAGGA 


180 


ATATCTGGAG 


ACATATCCAT 


AGTTTCACTA 


CCATAACACC 


AGCATGGTGT 


AACTGTAATT 


240 


GTTGCGCAAA 


CATTTGATTT 


TTTAAACAAC 


TCATGGGAAG 


CTGCAGCCTC 


TGGAACACGA 


300 


CCAATGGTAG 


ATGGAGAAAT 


CACACATTCC 


ACAGGTTCCC 


CATCTGGATA 


TTTCAATGTG 


360 


CTTGAAATCA 


AATCTGCCAC 


ACTTTTAGCC 


ATGTTCATTG 


TTTGTACTTC 


AAGTGATTCG 


420 


CGTACACCTT 


GACGACGACC 


ATCAATAGTC 


GGACGAATCC 


CAATACGTGG 


ATGTTGAATC 


480 


ATACTATTTT 


CCTCTTATTT 


TTCTGTTTCT 


TTCATACTAC 


CATCGCCATT 


ATATTTACGG 


540 


TAGCCTGGGT 


GACGACCTGT 


AACCTTATAA 


TTTTCTCGCA 


TTGAGAATAG 


ACGTTCTAAA 


600 


GTCGGACGAG 


CAATTTCTTG 


CTCCTCAATG 


CCCTTTGTAG 


AAAGTAACAT 


TCTTCCGTGG 


660 


AAGGTTGTCT 


TTGCGACTAA 


TTCTAAAGTT 


TCCATACGGT 


AGTATGCTGT 


AATTACATCG 


720 


CTTCCGACAG 


TCAGAG 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 590 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CTCATTGTAT TTAGGAGAAA TGATGGTATC TTCCAAATCA AAATCAACTT CACTCCATAG 60 

TCTCAACGGA TTGATTTTCC CATCTTGATA GGTCACATCC TTGTCAAGGA TAAACTGAGT 120 

CAACGCCTCA TGCTGACCTT GACACCTGAT GTCATCTACC AAGAGCCAGA CATCCTCTAC 180 

CAACATGAGG ATTTTTCTCT TGTGAAGATA AGGCAAATCA GGTTCTGCTG ACCAATAAGC 240 

CCCTTCAATA TAATGCATTC CCTCCCTTTC TTTATGGTGA CAAAACAGGG AGTGAGGATA 300 

GTATTCATAT TCCCAGGATC CCGTGATTCT TTCCGGAGCT TTCCCATCTA CAATGCAGGT 360 

CGAATGACTC CAAGCACTCT TTAAGAGATA ACGTTCATAT ATCTCCCGAT AAGAATAACG 420 

CCCAGCATCT ATGAAAATAG GTTGGCCTTG ATACTGTAAG CAAAAACTAT TCTCGTCACT 480 

ATGGCTATGG GCACTTCCTA GCGGACCATT TTTGAAAAAT AGATAACGAT GTTCATCCTT 540 

AATGCAGACA TGTCCAGAGT CTTCAAAGAT CATGGACTTA GGCTGCCAAG 590 

(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTER I STT S: 
<A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



CTAGTGTAGC TTCCTTAATC TTATCTGATA AGATAGCTGT CATATCAGAC TCAATCATTT 60 
CCTGGAGCAA TCACATTGAC TCGTATATTC CGACTAGCGA CCTCGCGTGC CACAGACTTG 120 
GTAAAGCCAA TCAAGCCAGC CTTAGAAGCA GCATAGTTAG CTTGACCAAT ATTCCCCATC 180 
AAACCAACAA CACTAGACAT ATTAATGATA GCACCTTCTC TGGCTTTCAA TCATCGGTTT 
CCAAAGACTG ATTGTGTCAT ATTAAAGGCA CCAGTCAGAT TGACCTTGAG CACTTTTTCA 
AAATCTGCTT CTGTCATCTT GAGCATAAGA GTATCTTGGG TAATCCCTGC ATTGTTGACC 
AAAACATCTA CTGAACCCAG TTCTGCAATA GCTTGATCAA TCATACGCTT AGCGTCTGCA 
AAATCTGATA CATCTCCTGA AATGGGAACC ACCTTGATAC CATAGTTTGA AAACTCAGCG 
AGCAATTCTT CTGAGATTGC CCCACGACTG TTTAAGACAA TGTTGGCTCC TGCTTGAGCA 
AACTTGTGGG CGATGGCAAG ACCAATTCCA CGACTCGAAC CTGTAATAAA GATATTTTTA 
TGTTCTAGTT TCATTTTTTT CCTTTCAAAA CTTCTACTTA TTTTAGTCTA TTTTTCTAAA 
AGTGCTACTA AACTCGCTTG ATCTTCCACA TGAGCTAAGT GAGCAGTTTG ATCAATTTTT 
TTAACAAAAC CTGACAAGAC TTTCCCCGGT CCAATCTCGA ATAAAGTTGC TTATGCCTGC 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TTCTTGCATG ACCCCAATAC TTTCATAGAA 
GAGCTGAGCA ATGTCCTCTT TTTGCATCAC 
AGTAAAATCT GAAAAACTTA CCTGAGCTAG 
AAGGAGAGCG GTGTGAAAGG GACCTGACAC 
TTCTTGCAAA AGTTCAACCG CTCGATCAAC 
TGCAGGTGTG TTATAGTTGG CTGGAGTAAC 
CTTCTTCAAT GACCTCTACT GGCGTATTGA 



PCT/US97/05306 





TCCTTGACCT 






AGCAUtTTCT 








AGTTTCAGCT 


AGTTTCTGGC 


TAGCAGGCTC 


960 


CTTAAGAGGA 


ATCAAGCGTT 


TGGCACCTGC 


1020 


TGCAACCACT 


TCTCCAGCAA 


TGACGATTTG 


1080 


CACTCCAAGT 


TCCAGAAGCT 


TTTTGACAGG 


1140 


GAACTGCTAC 


CATCTTGCCA 


AGTTCAGCA 


1199 



(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



CTACTATTTT 


CTTTGCCATA 


GCCTTCTCCT 


TTACACACTA 


CGCATATCGT 


GGTAAGAAAC 


60 


ACTGCGTCCC 


ATCTCACCTG 


CATTCTCTTT 


TTGAACAAAG 


GTATTAGCGT 


TTATATAGGC 


120 


AATAGCAGAA 


GCCTTCAACA 


CATCAAAATC 


AAGCCCTGCT 


GCATTAAAGA 


TGGTTTCTGT 


180 


ATCCCTGTTT 


TCAACAGTGA 


CCAACAACCC 


GATCCTGGGC 


ATCGATTCCA 


TCTGTTACCG 


240 


CATTGATAGT 


GTAGGACACC 


AAACGAACAG 


ATTGGTTAAA 


GAACTTATCG 


ATAGCGTTAA 


300 


AGATTGCTTC 


AACGGAACCC 


TGTCCCTGTC 


GCATTAAATT 


CGACTTTCTC 


ACCATCCATA 


360 


TTGGCTAGGC 


TAACGAGCGC 


TTCAATGTCA 


TTATCTGCAT 


GAGTTTGAAG 


TTGTAAATCA 


420 


TCAAAGTGGA 


AGCCTTCTGG 


ATTTTCAACC 


ATGGTTCCAG 


CTACCAAAGC 


TCGAGTATCT 


480 


GCATCTGTGA 


TTTCTTACTT 


CTTATCGGCC 


AGTGCCTTGA 


ACTTAGCAAA 


GAATGGTTTG 


540 


ATATCCTCTT 


CTGTAAAATC 


TAAGGCCAAT 


TCTCTCAGTT 


TCTCAACAAA 


AGCATGGCGA 


600 


CCAGATAATT 


TTCCAAGCGG 


AATCTTAACA 


CCAACCAATT 


CAGGTGTGAT 


GATCTCATAA 


660 


GTGAGAGGAT 


TTTTAAGGAC 


TCCATCTTGG 


TGAATACCAG 


ATTCGTGGGA 


GAAGGTATTG 


720 


CCACCAACGA 


CGGCTTTGTT 


TTTAGGAACT 


GGAATACCAG 


AGAAGCGAGA 


AACCATTTCT 


780 


GACGTATTGA 


TGGTCTCATT 


TAGGACAATA 


CTGGTTTCTA 


CTTGGTAGTA 


ATCTTGGCGA 


840 


ATATTGAGAG 


CCACTGCAAT 


TTCTTCCAAA 


GCAGCATTTT 


CAGCTCGCTC 


CCTAATACCA 


900 


TGGATAGTCT 


CTTCAAAAAG 


TCCTGCACCA 


TTCTTGACAG 


CAGCAAGGCT 


ATTTGCCACT 


960 


GCCATCCGAA 


GTTCATCATG 


ACAGGGAGGC 


GAATAGATGA 


TCTGACGATC 


CGTCTGGACA 


1020 


TTCTCAATCA 


GGTATTGGAA 


GATGGCACCA 


CATTCCTCTG 


GTGTGGTAAA 


TCCTATATTT 


1080 


TCTGAAAATT 


TCTTCAGTAA 


AGAATATTTA 


GCTAATTGAA 


AGTTCATGAA 


AATTATTAAA 


1140 


ATATTTCATT 


TTTTAGAGGT 


TAAGTTCCAA 


CTTTTTTCTA 


TCAATTCCAG 


TACTTNTTCA 


1200 


TCTGATAAAG 


TATCATCAAG 


GGACACACTA 


ATCCAGTAGC 


GCTTGCTCAT 


ATGGAAGGCT 


1260 


GGATAAATCC 


CCTTTTGTGA 


AAGCAAATTA 


GCTACTTGGT 


CATGCTTGAG 


GTTGACTGCT 


1320 


TCCACTTGTC 


CTTCTCTGCC 


CTTTTCCAGC 


TTATTCCAAG 


AGATTTTCAN 


CAAGACGGCA 


1380 


TACCACTTTT 


GATTGCCTTC 


ATGGCGCAAT 


ACAG 






1414 
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(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



CTCCCCATTT TGGAAAATTT CTGTCAAGAA ACGGCGAACC AGCTTTTTAT TTTCTGCTTT 60 
CTTGTCCAAA TCCTTGATTT CAAAATCTCC AAAAATTTGA TCTAGTTGGT CATTTTCAGG 120 
TGTTCGATAG TAGTCAATGA CATCCCAATG CTCAACAATA CGACCATTCT CATCCGCACG - 180 
GAAAGTATCC GTCGTCACCC ATTGAGCTTC TCCACCATTC AGATATTGAT GGAACATGAA 240 
CAAAGACCAG ATTGCCATCC TCAATGGTGC GGACAATCTT AATCTGACGC TCTGGATCAC 300 
GCTCAAAGAA ATCTGCAAAG AAGGCTGCAA ATCCTTCTTT CCCGTCAGGA ACACCTGTCG 360 
AATGTTGGAT ATAGGTATCC CCTACAGACT GGGCTTGAGC CTCAGCAACT CGTCCGTCTT 420 
GAATGGCATG GATGTATAGG TTGTGAGCAT TTTTCACTTG TTGTGACATA TTCTAAACCT 480 
CATTTCCCTT CTCTTTCAG' TTCGCCAAAA TTCTTTCTTG AAAACCTTCA AATTGGTGAA 
TTTCTTCCTC TGAAAATCcT TTGTAAAAGA TAGTATCCAA TTTCTGACTG ACACGATGCC 
CCACTTCTTT CTGGGACTTG CCTAACTCCG TTAAAACTAA ATACTTCTTA CGCTTGTCTT 
TTCCACACGG ACTAACAATT ACAAGCTTTT GTTCCTCTAG CTTTTTTATC ATAGTCGTCA 
GCGTATTATT CGCAAGCCCA GTCGCAAGCG 



540 
600 
660 
720 
750 



(2) INFORMATION FOR SEQ ID NO: 81: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 
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GTATCGTTCA ATTCCAAAGA TACGGAGGAA GCTGCGTTTC GCGAGAAAAC TCTTGTTAGA t>u 

CCAGATAGGG ATTAGCCAAA ATCCTTCTAT GNGCTTATAT CCAAATGGAG ATGCAAACTT 120 

CTGCCCAACT CTTGACCAAT AAGATCTTTC TTAAAAATCC ACTCAAGGCT CTGGTAGAGG 180 

AAAAATATGG GATTGAGTAT GAAGAATTTA CCAATCCTTG GCACGCTGCC ATTTCTAGCT 240 

TCGTTGCCTT TTTCCTTAGA AGTTTGCCTC CAATGCTGTC AGTGACCATA TTCCCAAGTG 300 
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AATATCGCAT CCCTGCTACT GTCCTTATTG TCGGTGTGGC 
CTAGTGCCAG ACTTGGAAAG GATCCGACTA GAACAGCTAT 
GTCTCTTGAC CATGGGAGTT ACCTTCCTGC TCGAACAACT 
GAAATACCTC GATTTTGAAG TCGAGGTATC TTTTTTACAT 
TTCTTGAAGT AATCATGAAA ATCAGCACAT AGGCGATGAG 
AGGTCACAAT CAACATCATA GTCGTATCCA GTACACCAAT 
GCATATGGTA GGCAAAGGCG AGATGTATGA AGGCAAAGAG 
TTAAAACCTG TTTGTTGATG GTTTGCTTGA TTTGCTTTTG 
AGATAATAAA GCGTTCACGG TCTTCGTAGC CTTCAGAAAT 
GAACAGTTCC GACCATAAAG ATAATGGATA GGAAAATACC 
AGAGGACACT CATTTGAGCA CTAGCATCTG CTAGATTGCT 

<2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CTGATTGAAC 


TTCTATTTAC 


TAATATTCAA 


AAATCCTCCG 


TTTCAAAGAG 


CAGGGAACTC 


60 


TTTGTGACAG 


AGGATTTTTT 


CTATAGGGCT 


TTAGCAGCTG 


CAATTGCGGC 


TTCGAAGTTT 


120 


GGCTCAGAAT 


TGATATTATC 


CACGTATTCA 


ACGTAGCGAA 


TCGTATTGTC 


AGTATCGAGG 


180 


ACAAAGACTG 


CGCGTGCTAA 


TAGGTGCCAT 


TCGTTGATCA 


AGAGGGCATA 


ATCGCGCCCG 


240 


AAAGAATGGT 


CAAAGTAGTC 


TGAAAGCATA 


ATGGCATTGT 


CAAGGCCTTC 


AGCACCGCAC 


300 


CAACGTTTTT 


GAGCAAAAGG 


TAGGTCCATT 


GAAACAGTCA 


ATACGACCGT 


GTTGTCCAGT 


360 


CCAGCCAATT 


CTTCATTAAA 


ACGACGTGTT 


TGAGTTGAGC 


AGATGCCTGT 


ATCGATAGAA 


420 


GGAACGACAC 


TCAAGACTTT 


TTTCTTGCCA 


TCAAAATCAG 


CCAGAGATTT 


TTTAGAAAGA 


480 


TCTGTTGTAG 


TAAGAGAAAA 


ATCAAGCGCC 


TTGTCGCCGA 


CTTGTAGTTG 


TTTACCTGTA 


540 


AAGCTCACAG 


GATTTCCGAG 


AAAAGTTACC 


ATAGGATACT 


CCAATCTTTT 


TTCTTCCATT 


600 


GTATCTGAAA 


CAGTCAGAAT 


TTTCCAATGA 


TTTGACCGGA 


AATGTGGGCA 


TAGAAAAAAC 


660 


GCCAG 












665 



(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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CCTTCTTCTC ACTGGTTACA 360 

GATTCGGAAC CTTGCTATTG 420 

TTTCAGCATT TAGAATACAA 480 

TTGCACAATC TTGCGATAAC 540 

GAAGATAGCG CAGATAGACA 600 

CACTTTTAAA ATCAGGCTAA 660 

CAAAGGAAGG AAGAAAACAG 720 

GTCCAAACCG ACTTTCTGCA 780 

TTGTTTGTAG TAGATGACCA 840 

GATAAAGAAG ACACCGCCAA 900 

ACCATAAACA TAG 953 



WO 97/37026 PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



l« 1 rLrlo 1AX An 


21 A & 21 PTTP pr* 
AnnAL X X uLt 


GAAAGTATCT 


CTGAAGTTAA TTTGAATGTC 


TTTTTTCAAA 


60 


A 1 111 Awii 


XXX 1\j X iuAA 


ATCTTTAATC 


AAGAGCCTTG TCTGATAAAT 


TTTTTTGAGT 


120 


i IniAjnnl XV 


T2nV"T"P2ir , 2i 21 


GGCGATATTC 


CAAAGTTAAC AATAAACCAA 


GAATAATAAG 


180 




TATGAAAGAT 


ACGATTTCTA 


ATAAGGATTT GATTTCCATG 


GGCTATAGAC 


240 


CATCAACAGC 


AAATGCTATT 


ATCCATCAGG TGAGAGAATT ACTTGTATCA CGAGGCTATA 


300 


CATTTTATAA 


TCGCAAACGT 


TTGATGGTTG 


TTCCAAAAAG TGTTGTGAAA 


GAGTTGTTGG 


360 


GAATGGAATT 


GTGAAATGGC 


TTATATCGAG 


TATAAACAGC GTGGGAAGAA AAGGCTTTGG 


420 


TCGTTTTCTA 


TACGTGAGAG 


GAGCAAGAGC CTACTCCATA AAAGCGGATT 


TAAAACAAAA 


480 


CGAGAAGCTA 


AAATAGAAGC 


GGAGAAAGTT 


CTTCATAAGT TAAATACTGG 


GAGTGTCTTA 


540 


AGCTCTAGTA 


TGACTTTATC 


GGAGCTTTAT 


AATGAATGGC TGGATTTAAA 


AATTTTACCT 


600 


AGCAATAGAA 


GTGTAGTTAC 


TAAAAAAAAA 


TATCTTATGA GAAAGAAGGT 


CATCGAAAGG 


660 


TTATTTGGAA 


ATAAGCCTGT 


ATCACAAATT 


AAGCCTAGCG AGTATCAAAA 


AATTATGAAT 


720 


GAGTATGGAG 


AGACTGTATC 


GAGAAATTTC 


TTAGGAAGAT TGAATTCTAG 


TATCCAGGCA 


780 


AGTATACAGA 


TGGCTATTGC 


TGATAAGGTG ATAATAGAGG ATTTTACTGC 


TTATGTTGAG 


840 


TTGTTCTCTT 


CTAAGAGTGG 


ACACAAAGGT 


TGAGGAAAAG TATCTACATA 


CTGAGTCAGA 


900 


TTATCAAAAA 


GTTTTAGTAT 


ATTTGAAAAA 


TAAGTTTGAT TATCAGAAAT 


CTATTGTACC 


960 


GTATGT YTT 


TATTTCCTTT 


TTAAAACTGG 


CATGCGTTTC TCCGAATTGA 


TTGCTCTAAC 


1020 


TTGGGATGAA 


GTTGACGAG 








1039 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CTGAGAAGTC AGGAAGGCGA TGTGGAGAGA TTGAGGAATT GGATACTTAA CTCCATAGTA 60 

ATCTTCGTAA AACTCGATAG AGCGAACAGC GATATCCAGT GAGAAATCAA GATTTGAAAG 120 

TGGATGTGCT TTGGTTGAGT AGACACCCTA CCAGGGTACC ATTTTTAGTT TTAGCGGTCA 180 

CCCCTTGCAA ATCACCAGCA ACAAAGGCCA ACAAGTAAGA AGACATGCGA GGTGTTGTCT 240 

CAAACTTCCA GATACCTGTT TCCTTACGGT TTTCAACATC GATTTCTGGC ATGTTTGACA 300 

AGGCCAATTC ACCTTCTGCT TGGTCAAAGC GAAGAGAGAG GTCAAAAGTT GCTTTGGCTT 360 

CAGGCTCATC CACACATGGG AAAGCTTCGC GCGCAAAATG GCTCTCGAAC TGAGTAGACA 420 

AGACCTCCTT CTTGACTCCA TCAACTGTAT AATAAGAAGG GTAAATCCCT GTCATGTTGT 480 
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CTGTAATTTT ACCAGAAAAG GCAAGAACCA ATTCAACTTG ACCAGCCTCA GCCAATTCGA 
TATGAAGGGC TTCATTGTCA TGGTCAACTG TAAATGGACG AGCTTGACCT GCAACTTCTA 
CAGAGGTGAT TTCCAAGTCT TTTTGGTGGA GGGAGATGCG GTCACTCTGT GCTTGACCAG 
TGATGGTCAC TTTCCCAGAA AAAGTCTTGG TCTCACGACT CAAATCTAAA AATAAATCAT 
AATGTTCAGG AACAAATTGC TTAATAAAAT GTTCAACTGC TTGCATAGTT TTCTCCTATT 
CTAAGTTTAA GAG 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



.CGGATGATTT 


GCTTTTCATC 


TATGATAAAG 


TCTTTGTTGA 


GCGTCGGAAT 


CTCTACTGAC 


60 


TGGAAATTTC 


CCGTAGATAA 


TCCAAATGCC 


CTTTAAAGAA 


AACCTCATCT 


GTCAACACCG 


120 


AAATCATCAC 


TGCTCCGTTT 


TCTTCATAAG 


TCTGGGCCTG 


TTGCACAATA 


1 CCACATCGA 


180 


GATTGATATC 


TCCCCAAAAC 


TAGGGCTAGC 


TTTCTTGACT 


TCAGCGATTA 


CCTGCAAGCG 


240 


GTCCTGATGA 


TTCTTCAAAA 


ATTCTGCCAA 


GCGATAGGTC 


TGGCGCAGAG 


GCTGGATTTG 


300 


CTCCAGCTTC 


ATCTGCTCCA 


CCTCACGCGC 


CTTCTGCTCT 


AAGATTCGTG 


CTAAAAATTC 


360 


CTGACTCATT 


TTTGGTACTC 


CTGTAACAGT 


CTGAGTTTTT 


CAAGGGCCTT 


GCCTCTAGCA 


420 


ATCACTTGAC 


GGGCCAAGGC 


AACTCCTTCC 


TTGATGCTAT 


CAATCTTACC 


ATTAGCATAG 


480 


AAACCAAGAC 


CAGCATTCAA 


GACTGTCGTT 


TCCAAGAATG 


GACTTGCTTC 


GTTTTTCAGA 


540 


ACGCTAAGCA 


AAATTTCTGC 


ATTTTCCTGA 


GCATTCCCAC 


CACGAATATC 


TTCCATAGCA 


600 


TAGCCTTCCA 


TTCCCAAATC 


CTCTGGAGTA 


AAGCTTGACA 


AG 




642 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CTCTTGCAGG TTATTAAGGA GAAAACGGAG GTAATAGAAG TATGATTTAT ACAGTCACAC 60 
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TCAATCCATC CATTGACTAT ATCGTTCGTT TGGACCAAGT CAAAGTTGGT AGTGTAAATC 120 

GTATGGACAG TGATGATAAG TTTGCTGGTG GGAAAGGAAT CAATGTCAGC CGTGTCTTGA 180 

AACGTTTGAA TATATCAAAT ACAGCGACGG GATTTATCGG TGGCTTTACT GGTAAATTTA 240 

NCNCAGATAC TTTAGCAGAG GAAGAAATCG AGACNCGTTT TGTCCAGGTG GCAGAAGATA 300 

CTCGTATCAA TGTTAAAATC NAAGCAGACC AAGAAACAGA AATCAACGGA ACGGGTCCAA 360 

CTGTTGAACC GGTTAAGCTA GAAGAATTGA AAGCTATTTT ATCTAGTCTG ACAGCAGAAG 420 

ATACAGTTGT CTTTGCAGGT TCAAGTGCTA AAAATCTAGG CAATGTTATC TATAAGGGAT 480 

TTAATCTCCT TGACGCGCCA GACTGGTGCG CAAGTGGTCT GTGACTTTGA AGGACAGACC 540 

TTAATTGATA GTTTGGATTA CCACCCTCTT CTTGTAAAAC CAAATAATCA TGAACTTGGA 600 

GCGATTTTTG GGGTTAAACT CGAAAGTTTA GATGAAATTG AGAAATACGC TCGTGAGTTA 660 

CTGGCTAAGG GTGCTCAAAA TGTTATTATC TCTATGGCTG GTGATGGTGC CCTTCTTGTC 720 
ACATCTGAGG GAG 



733 



(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CTAAGGGAAG TAGGAAAAGT ATGTATCCAG ATGATAGTTT GACATTGCAA CACGGACTTG 

TACCAGATCA ACATGATGCA GGTTTACTTT GATCAAGGGA TTTACAATAA GAAGGCGGTC 120 

TTTGAGGTGT ATTTCCGCCA ACAGCCTTTT AAGAACGGCT ATGCGGTTTT TGCAGGTTTG 180 

GAAAGAATTG TGAACTATCT TGAAGACTTG CGTTTTTCAG ATAGTGATAT AGCCTATTTG 240 

GAGTCGCTTG GTTATCATGG GGCGTTCTTG GATTACCTTC GCAATTTCAA GTTGGAGTTG 300 

ACCGTTCGTT CTGCCCAAGA AGGGGATTTG GTTTTTGCTA ATGAACCGAT TGTCCAGGTG 360 

GAAGGACCTC TAGCCCAATG TCAGTTGGTC GAAACGGCTC TTTTGAACAT CGTCAACTAC 420 

CAGACCTTGG TGGCGACGAA GGCAGCCCCC TATCCGTTTG GTTATCGAAA ATGAACCCTT 480 

GATGGAGTTT GGGACACGTC GGGCTCAAGA AATGGATGCG GCCATCTGGG AACACG 536 

(2) INFORMATION FOR SEQ ID NO: 88: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



CTGCCCCTGT 


AAGGCTGGAC 


GATTGCCTTT 


CTTAGTATCC 


GCAAAGAGGT 


AAACTGAGAA 


60 


TAGAGAGGAT 


TTCTCCTTCA 


ATATCTTTGA 


CAGACAGGTT 


CATCTTGCCT 


TCTACGTCTG 


120 


AAAAAATCCG 


CATATTGACC 


AGTTTTCTCA 


CAGCATAGTC 


CAAATCTTCC 


TCTTGGTCCT 


180 


CTGGTCCAAC 


ACCAACCAGC 


AATAAAAGTC 


CCTGATTGAT 


TTTTCCCTGA 


ATCTGGCCTT 


240 


CTATACTCAC 


TTGGGCTTTT 


TTAACCCGTT 


GGATAATGAT 


TTTCATAATA 


GCCTTTCTAG 


300 


TAAGAGCTAG 


GACAACTAGC 


CGTTGGTCCG 


TTTGACAGAG 


TAAACTTCTG 


GCACACTCTT 


360 


AATTTTATCG 


ACAACCGTGG 


TCAGTGTAGA 


GAGGTTGGCA 


ATACCGAAGG 


ACACATGGAT 


420 


ATTAGCAAAC 


TTCATATCCT 


TGGTTGGTTG 


GGCATTGACC 


GTTGAAATAT 


TCTTGGTTGT 


480 


ATTTGAAAGA 


ACTTGCAGTA 


CATCGTTCAA 


CAGTCCTGTA 


CGGTTGAGAC 


CGTAGATATC 


540 


GATATGGGCC 


ATATACTCCT 


TATTTGAG 








568 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



CTTTATATTG 


TACCAAGTAT 


CATTTCTAGT 


GCTGCTATGG 


GCTTGATTTT 


TCTTCAAATC 


60 


TATAATCCAA 


ACTATGGTGT 


TGTTAACCAA 


ATTATTCATC 


TATTTAATCC 


ATCGTTTAAA 


120 


GATTCAGTAC 


TGTTGACTCC 


CAGGATTAAA 


AATAGTAGCT 


ATGACTGGCG 


CTTATATCTT 


180 


CTTTGCAGGA 


GCATCAACCA 


TTATGATTTC 


TTGGGCAAAT 


TTTTGCTATT 


CCAGAAGAAG 


240 


TTCAAGAAGC 


TGCTATTTTA 


GACAATATTA 


CTGGTTGGAG 


GAAAGAGTGG 


TATATTACGA 


300 


TTCCGATGAT 


TAAGGGGACA 


ATTAAAACTG 


TTTCAATTAT 


GGCAGCAACT 


TCAGGATTTT 


360 


TGCTCTATAA 


CGAAGTATTC 


TTTTTGACAA 


ATGGTGCTGC 


AGGAACAAAA 


AGTATCAGTT 


420 


TTGTTATTCG 


AGAATTAGCA 


GTGGCTAGCT 


CACGAACTCA 


GTATGCTCGT 


GCAAATACAA 


480 


TTGGAGTTAT 


ACAAATCTTA 


GGTGGAATGT 


TGATTATCGT 


TTGTATTAAT 


ATTTTATTCA 


540 


GAGAAAGAAA 


AAGACTGAAA 


GGTGGGAAAT 


GATTATGAAT 


ACACATATAA 


ATGGTATTAG 


600 


TAAAAAAGGC 


AAAGTTCTTA 


TATATGGTTA 


TATGCTCCTT 


ACCATTTTAA 


TTTCTATTTT 


660 


CCCTATTGCG 


TGGATTTTTT 


TATCATCATT 


AAAAGCAGAT 


CCTATGAAAA 


ATCCAGGTAT 


720 


TAGTTTACCG 


ACTGACTTTA 


CTCTTGAAGG 


TTATATAAAT 


GTTTTTACAA 


AACTTCATGT 


780 


TTTTACTTAC 


TTTTGGAATA 










800 



(2) INFORMATION FOR SEQ ID NO: 90: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: €86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



CTCAGTATTA 


TCCCTACCAA 


AATGAACGTC 


AGTTAGTGAC 


TCAGGTGGTT 


TGGGAACAAT 


60 


GGGCTTTGGA 


ATTCCAGCAG 


CAATCGGTGC 


TAAAATTGCT 


AACCCAGATA 


AGGAAGTAGT 


120 


CTTGTTTGTT 


GGGGATGGTG 


GTTTCCAAAT 


GACCAACCAG 


GAGTTGGCTA 


TTTTGAATAT 


180 


TTACAAGGTG 


CCAATCAAGG 


TGGTTATGCT 


GAACAATCAT 


TCACTTGGAA 


TGGTTCGCCA 


240 


GTGGCAGGAA 


TCCTTCTATG 


AAGGCAGAAC 


ATCAGAGTCG 


GTCTTTGATA 


CCCTTCCTGA 


300 


TTTCCAATTG 


ATGGCGCAGG 


CTTATGGTAT 


TAAAAACTAT 


AAGTTTGACA 


ATCCTGAGAC 


360 


CTTGGCTCAA 


GACCTTGAAG 


CTACTACTGA 


GGATGTTCCT 


ATGCTAATTG 


AGGTAGATAT 


420 


TTCTCGTAAG 


GAACAGGTGT 


TACCAATGGT 


ACCGGCTGGT 


AAGAGTAATC 


ATGAGATGTT 


480 


GGGGGTGAAG 


TTCCATGCGT 


AGAATGTTAA 


CAGCAAAACT ACAAAATCGT 


TCAGGAGTAC 


540 


TCAATCGCTT 


TACAGGTGTC 


CTATCTCGTC 


GTCAGGTTAA 


TATTGAAAGC 


ATCTCTGTTG 


600 


GAGCAACAGA 


AGATCCGAAT 


GTATCGCGTA 


TCACTATTAT 


CATTGATGTT 


GCTTCTCATG 


660 


ATGAAGTGGA 


GCAAATCATC 


AAACAG 








686 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



TTTTCCCAGC TATTTTACTG AGTATAAAAG CCTTATTTAA CTTACTCTAT GTACTCGGTT 60 

TTCTAGGAGG AATGTTGGGA GTTGGGATTG CTTTGGGGTA CGGAGTGGCC TTATTTGACA 120 

AGGTTCGGGT GCCTCAAACA GAAGAATTGG TGAATCAGGT CAAGGACATC TCTTCTATTT 180 

CACAGATTAC CTATTCGGAC GGGACGGTGA TTGCTTCCAT AGAGAGTGAT TTGTTGCGCA 240 

CTTCTATCTC ATCTGAGCAA ATTTCGGAAA ATCTGAAGAA GGCTATCATT GCGACAGAAG 300 

ATGAACACTT TAAAGAACAT AAGGGTGTAC TACCCAAGGC GGTGATTCGT GCGACCTTGG 360 

GGAAATTTGT AGGTTTGGGT TCCTCTAGTG GGGGTTCAAC CTTGACCCAG CAACTAATTA 420 

AACAGCAGGT GGTTGGGGAT GCGCCGACCT TGGCTCGTAA GGCGGCAGAA ATTGTGGATG 480 

CTCTTGCCTT GGAACGCGCC ATGAATAAAG ATGAGATTTT AACGACCTAT CTCAATGTGG 540 

CTCCCTTTGG ATCGAAATAA TAAGGGACAG AATATTGCAG GGGCTCGGCA AGCAGCTGAG 600 
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GGAATTTTCG GTGTAGATGC CAGTCAGTTG ACTGTTCCTC AAGCAGCATT TTTAGCAGGA 660 

CTTCCACAGA GTCCCATTAC TTACTCTCCT TATGAAAATA CTGGGGAATT GAAGAATTGA 720 

TGAAGACCTA GAAATTGGCT TAAGACGGGC TAAGGCATTC TTTACAGTAT GTATCCTACA 780 

GGTGCATTAA GCAAAGACAA TT 802 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



CTACTATTTT 


AGCATAAAAA 


TGCCCAAAGG 


GGGNGCCGTG 


TGTTTACTGA 


TTTTCAGGNT 


60 


AATGGACCAG 


GNAAATCAGC 


ATGAAAATAA 


AAAGAGAAAC 


AGATTATTTT 


AGCCATTTGT 


120 


CAGATTTATG 


CTATGCTTAA 


GGTAGAAAAT 


GAAAGGGATA 


ACAAATGTAT 


TTAGGAGATT 


180 


TGATGGAGAA 


AGCCGAGTGT 


GGTCAATTTT 


CAATCCTTTC 


CTTTCTATTA 


CAAGAGTCTC 


240 


AGACGACCGT 


CAAGGCTGTA 


ATGGAAG/, A 


CAGGATTTTC 


AAAAGCAACC 


CTAACCAAAT 


300 


ATGTCACCCT 


GCTCAATGAC 


AAGGCTTTGG 


ATAGTGGCTT 


AGAACTGACT 


ATTCACTCAG 


360 


AAGATGAAAA 


TCTGCGTCTG 


TCTATAGGTG 


CAGCTACTAA 


GGGGAGAGAT 


ATTCCGGAGC 


420 


TTGTTTTTTG 


GATAGTGCTG 


TTAAATACCA 


GATTTTGGTT 


TATCTTCTCT 


ACCACCAACA 


480 


GTTTTTAGCC 


CATCAGCTGG 


CTCAAGAATT 


GGTGATTAGC 


GAGGCTACGC 


TTGGTCGTCA 


540 


CTTAG 












545 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

GAGGTCAAGA TTGATTCCCA AGGTTNGAGG AATCTGCGGA ACCAGATTTA GAAGAAATCG 60 

AGCGGTTACC ATGCTTGGCC ATGTGAATAC CGCCACCAGC CAAGACAAAG GCTGCAGTTG 120 

TGGAAATATT AAAACTGAAA GACTTGTCCC CACCTGTACC ACAGTTGTCC ATAGCATCAT 180 

GAATCTCAGT TGGAATATGC TGGGCATGTC CTCTCATGAC TTGGGCAATG GCTGTGCGTT 240 
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CTTCAGGTGT TTCCCCCTTC ATCTTAAGAG CTAAGAGGAG AGAAGCAATC TGCGCTTCAG 300 

TTACACGCCC AGTTACGATA CGCTCAATGA CATCCGTCAT TTCCACACCT GATAAATTTT 360 

CAAATTTTGC TAGTTTTTCA ATAATCTCTT TCATCCTAGT TTCCTCACTT TACAACCTCC 420 

TCGATAAAAT TCCGAATAGA AGACAAGCCG TCTGGCGTTC CAATGCTCTC TGGATGGTAC 480 

TGGAAGCCAT AAATCGGTAG GTTTTTATGT TGAATCCCCA TGATGGCTTG GTCATCAGTC 540 
GAACGAGATC AAGCTTATCG ATACCGTNGA CCTCGA 



(2) INFORMATION FOR SEQ ID NO: 94: 



576 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



60 
120 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATTAAAGGGA GGCGAACATG GCCCAAGATA TAAAAAATGG AGAAGTAGAA GAAGTTCAAG 
AAGAGGAAGT TGTGGAAACA GCTGAAGAAA CAACTCCTGA AAAGTCTGAG TTGGACTTGG 
CAAATGAACG TGCAGATGAG TTCGAAAACA AATATTCTTC GCGCTCATGC AGAAATGCAA 180 
AATATCCAAC GCCGTGCCAA TGAAGAACGT CAAAACTTGC AACGTTATCG TAGCCAGGAC 240 
TTGGCAAAAG CAATCTTACC ATCTCTTGAC AACCTTGAGC GTGCACTTGC AGTTGAAGGT 300 
TTGACAGATG ATGTGAAGAA GGGCTTGGGG ATGGTGCAAG AAAGCTTGAT TCACGCTTTG 
AAAGAAGAAG GAATTGAAGA AATCGCAGCA GATGGCGAAT TTGACCATAA CTACCATATG 
GCCATCCAAA CTCTCCCAGC AGACGATGAC CACCCAGTAG ATACCATCGC CCAAGTCTTT 
CAAAAAGGCT ACAAACTCCA TGACCGCATC CTACGCCCAG CAATGGTAGT GGTGTATAAC 
TAAGATACAA AGCCCGTAAA AAGCTCGCAG TAAAAATAGG AGATTGACGA AGTGTTCGAT 
GAACACAAGA AAATCTANCT TTTTTACTCA GAGCTTAGGG CGTGTTCGAT TCGGCAATTC 
TGACGGTAG 



360 
420 
480 
540 
600 
660 
669 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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CTAGTTCACC 


ATTTCTATTT 


AAATTCGGCT 


CAACTACACC 


GAAATATCAT 


ATAAGGTAAA 


60 


GTTCCCACTA 


AGAATCTCAG 


AATGAAATTC 


TTCACTAACT 


TCAGCGGTCA 


AACTTAGCTT 


120 


ATCCAATTTC 


AGACCGATAA 


ACCCATACTT 


CAGAGATACT 


TCCTCTTGAA 


ATTTAGAGTA 


180 


ACTCTTATTA 


GACATTTGAT 


GTATCGTTAT 


CGGGCTATTA 


GAAGGAGCCC 


GATTTAAAAC 


240 


TTCTTTATTC 


ATGAAATTTC 


TCCTTCAAAA 


AACAAAACAT 


TCTATATACT 


TAAATTTTAG 


300 


GGCAAGACTG 


CCGTTCCACT 


GACCACGCTC 


CACTTGCCAG 


CAAAGCTGGC 


AGGTCCAGCC 


360 


TTACCCTAAA 


TATAAATTAT 


TTAATTTTTG 


CCATTTCAAT 


CAATTCTGCA AAGGTCCCAA 


420 


GCTTTCTGGA 


TTGAATTACC 


TCATATTGGG 


CATTCTGTCC 


AAATGAAGCA 


TTATTAATCA 


480 


CTTCAACCTC 


ATAAAGGCCT 


GGAGCAGAGA 


AAACAGCTCC 


AAATTTGGAT 


TTATCGAAAT 


540 


GCATACCTAG 


GTCACTCCAA 


CAATCATTAA 


GGGTGGAGAC 


TTTTATCCAG 


CCTCGTTCTT 


600 


TCTTAGTAGA 


AATAG 










615 



(2) INFORMATION FOR SEQ ID NO: 96: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CTACTGTAGT 


TGGAACTGTT 


GAAAAACAAG 


GAAAACAAAA 


GAAAGTTGTT 


ACTTACAAGT 


60 


ACAAACCTAA 


AAAAGGTAGC 


CACCGTAAAC 


AAGGTCACCG 


TCAACCATAT 


ACAAAAGTTG 


120 


TCATCAACGC 


AATTAACGCT 


TAATTTTAAG 


GAGAACACAT 


GATACAAGCA 


GTCTTTGAGA 


180 


GAGCCGAAGA 


TGGCGAGCTG 


AGGAGTGCGG 


AAATTACTGG 


ACACGCCGAG 


AGTGGCGAAT 


240 


ACGGCTTAGA 


TGTCGTGTGT 


GCATCGGTTT 


CTACGCTTGC 


CATTAACTTT 


ATCAATTCTA 


300 


TTGAGAAATT 


TGCAGGCTAT 


GAACCAATCC 


TAGAATTAAA 


CGAAGATGAA 


GGTGGCTATC 


360 


TGATGGTTGA 


AATACCAAAA 


GATCTTCCTT 


CACACCAGAG 


AGAAATGACC 


CAGTTATTCT 


420 


TTGAATCATT 


TTTCTTAGGT 


ATGGCAAACT 


TATCGGAGAA 


CTCTTCTGAG 


TTCGTCCAAA 


480 


CCAGAGTTAT 


CACAGAAAAC 


TAACACGGAG 


GAAAACATTA 


TGTTAAAAAT 


GACTCTTAAC 


540 


AACTTGCAAC 


TTTTCGCCCA 


CAAAAAAGGT 


GGAGGTTCTA 


CATCAAACGG 


ACGTGATTCA 


600 


CAAGCGAAAC 


GTCTTGGAGC 


TAAAGCAGCT 


GACGGACAAA 


CTGTAACAGG 


TGGATCAATC 


660 


CTTTACCGTC 


AACGTGGTAC 


ACACATCTAT 


CCAGGTGTAA 


ACGTTGGTCG 


TGGTGGAGAC 


720 


GATACCTTGT 


TCGCTAAAGT 


TGAAGGCGTA 


GTACGCTTTG 


AACGTAAAGG 


ACG 


773 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 97/37026 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CTGGATAATT GGAGCAATTC CTGATGCCAG TTTGAGGTTA CTGATAGGAT TGTGGGCGAT 
AGCCACTTGA GAAGATGCCA AGAGTTCAAT TTCTCTCTCG TTTAATTCGA CCCCGTGAGC 
AAATACGGAC GGATGATCTA AATAACCCAG TTCTTCAAGA AAAGCAAGGG GGC™ CC 
GTACCGTTTG AGGATAATTC CTGACTCCTC CTTGGTCTCC GCCACATGGA CATGGAGCGG 
AATATTTAGC TCTTTTGCCA TTTCCAAACT CGCrPCCAGC AAGTCTCTAC £££££ 
CGGAGAATGA GGTGCTACCA TAACCCTTGA AATTTGGGAT TTTTATATTT TAAGATTTCG 
TCTATGATGG ATCGAGTTCT GCTTATAGTC TCAGCAGTTG TTTCTGTCTC TGAAGAAAAG 
AGAGTCGGAG AAAAATAACA ACGCATCTTG GAAGTTTTCA CCACCTGATA 
ATATCCACAC CATTGGGATT ATACATATCG TTAAAGGTTG TTGTTCCTGA ^C^ 
TCTGTTAGGG CTTCTTTGAC CGCATTGGTA GTCATGTCGG GAGTAAACTC ZZc^T 
GGCCAGATAT AGTCATTGAG CCATTTCATG GAGATTGCTG TCATCTCGGA TCCCTCTCAG 
ACCTGTCATT GCAGAATGGG TGTGACAATT GACCAAACCA GGCATAAGCC ^ 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTAGAAAATC TGCTTTTTTC TCGAAAAGTT TAATACTTGG TACAATATTA CGCTTGPTTT 
GCTCGGC^C TCCAAGGGCA ATTTTTTCTA GTTTTTCAAC ££££ 
CATCCCACTT GGCAACTGAC CAGTCTGCAG GAAAGGCCCA GATTTGGCTA Z^Z 
CGGTTACTTT TTGAGCGATG AACTCCAGCT TGTCTCCC^ C^ZZ ^ 
~AC ™ CC ACATTGTCAT „S 
TTTCCATATC CAGCACGCGC GCCAAGCGCT TTAmr-n^ » m AACTGACGAT 
=™, ATAACCTCAA ££££ 

CTCCOTITi CM *™™ ^C^C 

ACACCAGAGA TATCCTTCGT TTOTTJAM ACACAGGTAT TCCATTCCCC TT^r^ 

£S S£E "™" "°" e ** Wi= 

~ " ~ = S = 
CCACAC^T AAGCAAATCT CCAGCCOCTA CA^A.T 

117 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
713 



60 
120 
180 
240 
300 
360 
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460 
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778 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: ! 


99: 






CTTCCCAGAT 


TTTTTCGTTG 


ACTTGCTGTT 


CAATCGCACG 


CAATTCTTCA 


GCAGTTACAG 


60 


CTTGGAAGTG 


GGTAAAGTCA 


AAGCGAAGGA 


ATTCAACTTC 


GTTAAGAGAT 


CCTGCCTGTG 


120 


TTGCGTGGTT 


TCCAAGGATA 


TTGTGAAGGG 


CAGCGTGAAG 


CAAATGAGTC 


GCAGTGTGGT 


180 


TTTTCATGAC 


ACGGTGACGG 


CGATTGCTAT 


CAATTGCCAA 


GGTATATTCT 


TGGTTCAAGG 


240 


CAAGCGGTGC 


AAGGACTTCA 


ACTGTATGAA 


GGGCTTGACC 


ATTTGGGGCT 


TTCTGAACAT 


300 


TGGTCACAGT 


AGCCACAACC 


TTACCTGACT 


CATCCAAGAT 


TTGTCCGTAG 


TCAGCTACCT 


360 


GTCCACCCAT 


TTCAGCATAA 


AATGACGTTT 


CCCGCAAAGA 


TAAGAGAGGC 


AGTTCCTTCT 


420 


GAAACAGCTT 


CTACTTCTGC 


ATTGTCCGCC 


ACGAATAGCT 


ACCAATTTAG 


AAGACAATTG 


480 


GCT/ CTATTG 


TAGTTGAAGG 


CACTTTCTAC 


AGTGATGTTT 


TGAAGAGTTT 


CCATTTTGCA 


540 


TACCCATTGA 


GCCACCCTTG 


ACAGCTGACG 


CACGCGCGCG 




TCTTTCATGG 


600 


CTGCTTCAAA 


ACCTTCACGG 


TCTACAGTCA 


TACCAGCTTC 


TTCAGCGATT 


TCTTCAGTCA 


660 


ATTCAACTGG 


GAACCCATAA 


GTATCATAGA 


GTTTGAAGAC 


ATCTGAACCA 


GCGATAACAG 


720 


ATTGACCTTT 


TTCTTTCAAG 


TCTGCTACAA 


TGCCTTGGGC 


AAAGTGTTGA 


CCTTGAGTGA 


780 


AGGGTACGGG 


CAAATGATTC 


TTCTTCGCCT 


CTTAACGATT 


TTCTCAATAA 


AGTCACGTTT 


840 


CTCAAGCACT 


TCTGGGTAGT 


AGCTTTCCAT 


GATTTTTCCA 


ACAGTTGGAA 


CGAGTTTTGT 


900 


AAAGGAAAGG 


CTCGTTGATA 


CCCAATTTTT 


GACCCATGCA 


TAGAAGCACG 


ACGGAGAAGA 


960 


CGACGAAAGG 


ACATAACCAC 


GACCCTCCAT 


TTCCTGGAAG 


GGCACCATCC 


ACCGATGGCA 


1020 


AATGAAAGTG 


AACGGATGTT 


GGTCCGCGAA 


TGAACCTTGA 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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CTCAAAGGTC GCACCTGCAA TACGACCCGC TACAGGACCT ACACTTGCTC CATGCTTGGG 
ACTATTGCCT ACATAACTAC CAAAGTCATC AAATCCCAAG ATAACATTGG CAAAATTTCC 
AGCCTTGTCA GGTGCGACAT AGCGCAAGAT AGTCGCACCA TAAGTCATAA CCTCAAGTTG 
GTAGCCACCG TCTGTCTCAA ATCGATAGGC CAAGACATCC TCACCCTCAA CATTTCCAAA 
TACACGCTCT GTGTATGCTT TCATTCTGTT CTCCTTTTAC TATTTCTCTC AAGCAAACAA 
ACCATAGAAA GCGTACTGAC AATCTATGGT TTATCTGATA ATTTACAAAt' CCTCTTGTCA 
AGAATTCATA AACACTGTCT TACTTTTGAT ATTCGTGAAT TATGACACCT TGTACTACAC 
GGTTTACTGT ACCTGTAGGA GACGGTGTAT CTGGTTTATT TTCTACCTTG AGTGAAGTCA 
ATAGGGCAAA GAGTTGGGCA TAAACGATGT AAGGGAAGAC ACGGTAAATA TCATTCAAGA 
CACCGCCACA ACCAAGGGCC ACTTCTTTGA CATTTTCAAG ACCAAAAGCT TGATCACTCA 
AAAGCACAAC ACGACGAGCA ATCTGGTCAC CAGCAACTTC ACGAACCAAG TCCAAGTCGT 
ACTTACGAGT GTAGTCCGTC CTTGTACCAA AGAACAAAAC AACTGTATTG TCGTTGATAA 
GATATTTGGA CCGTGACGGA ACCAACTGGG CTTTCATACA TGGTCGCAAC TTGAACAACA 
GTTAATCCCA AATCTTGAGC TGAGCCTCAT GAGCAGTCCA AAGAAGACCA GCGCCTAGAA 
GAGACCGGTT AAGTCTTAAT CACAG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
865 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TTTTGCTCCA GTTGATAAGA AAGGCTTGGT CGATGTTGAG GCGTTAGCAG GTTTGATACG 
GCCTGATACA ATCCTCGTTT CCCATCATGG CTGTGAACAA TGAAATCGGC TCTATCCAAC 
CNATTGAGGC TATTTCAGAA TTCTTGGCAG ACAAGCCGAC TATTTCCTTC CACGTTGATG 

cgL7™o ATTCCGACTC GACAGAACGG G^GATTGCG 

CGACCTTCTC GAGTCATAAA TTTCATGGTG TCCGTGGTGT TGGCTTTGTC TATATCAAAT 
CTGGCAAGAA GATTACACCT CTTCTTACAG GTGGTGGCCA GGAGCGAGAT TATCGTTCGA 
«^C«M TCTOCC^ ««■ ^ ^CCO^ 
AGCTAGATAT CTTTAGGAGC AAGACTGGGC AGATGAAGGC AGTGATTCAC CAAG^C^ 
T^ACTAOCC GGATATTTTT GTCTTTTCAG ATGAGGAAGA CTTTGCACCT CATATTCTGA 
CT^TGGAAT CAAAGGTGTT CGAGGTGAAG TCATCGTTCA CGCCTTTGAA GACTATGATA 
TTTTCATCTC AACAACCTCA OCBWTCIT CTAAGGCAGG AAAACCAGCC GGTACCTTGA 
TTGCCATGGG AGTGGACAAA GATAAGGCCA AGTCAGCTGT GCGTCTTAGC CTAGACTTGG 
AAAATGATAT GAGTCAGGTC GAGCAGTTTT TGACCAAGTT AAAATTGATT ^ZIZ 
CTAGAAAAGT AAGATAGGAG CATTCATGCA GTATTCAGAA ATTATGATTC GCT« 
OTTOICMCC AAGGGTTAAA AACCGTATGC GTTTCATCAA TAAACTTCGT 
CGGACGTTTT GTCTATCTAT ACCCAAGTTA AGGTAACAGC AGATCG 

119 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
946 
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(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CTTGCTCGTA ACAGGTGCTA TCCTTGGTGT CAATGTTCAC ATCTTCTCAC CAAAAGAACT 
CTTCCCAGAA AAAGAAATCG TTGAATTGGC AGAAGGATTT GCTAAAGAAA GTGGCGCACA 
TGTTCTCATC ACTGAAGATG CTGATGAAGC AGTTAAAGAT GCAGACGTTC TTTACACAGA 
CGTTTGGGTA TCAATGGGTG AAGAAGACAA ATTCGCAGAA CGTGTAGCTC TTCTTAAACC 
TTACCAAGTC AATATGGACT TAGTTAAAAA AGCAGGCAAT GAAAACTTGA TCTTCCTACA 
CTGCTTGCCA GCATTCCACG ATACTCACAC TGTTTATGGT AAAGACGTTG CTGAAAAATT 
TGGTGTAGAA GAAATGGAAG TAACAGACGA AGTCTTCCGC AGCAAGTACG CTCGCCACTT 
CGATCAAGCA GAAAACCGTA TGCACACTAT CAAAGCTGTT ATGGCTGCTA CACTTGGTAA 480 
CCTTTATATT CCTAAAGTAT AATTTTAGAT AATAAACCGT CTACCAACAG CTATGAGGGC 540 
TGCGACTAAT AG 



60 
120 
180 
240 
300 
360 
420 



552 



(2) INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CTAAAAAAAG TCAAGTAGAA AACGAATATT CTACTTAACT TGCACGAAAT TATTTTTCAC 

GAATGACTTC GACCTTATAT CCATCAGGGT CTTTGACAAA GTAATAGTTT GGTGCAGTTC 1 

CTGGTAGACC ATTTGGCTCA GTCACTTCAT AGCCTTTTGT ACTGTGCTCT TGATGAAGTG 1 

CCTCAAGATC AGGTGTACTG AGGGCGATAT GGGCAAACCC ATCACCAACC ACATACGGAC 2 

CGTGATCGTA GTTATAAGTC AACTCCAACT CATAGTCATC ACCCTCAAGA CCTAGATAGA 3 

CAATCGTGAA GGCATGGTCT GGAAAATCTC TGCGACGCAA TTCTTTAAAA CCAAAAGCAT 3 

CTTGATAAAA TGCAATTGAT TTTTCAAGAT TTTCTACTCG TAAGCAAGTG TGTAGCATTT 4 

TTGAAGCCAT ATCTTTCTCC TTTATTTTTA AAAAGACTGG ACAATCCTGT TCCAGTCTCA 4 
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TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CT 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 



(2) INFORMATION FOR SEQ ID NO: 105 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 : 

= = = ™ = — 

TTCCTTTTTG <rrmn*n*r> ~ TAGTGG AAAAAGCGGG CAAGAAGGTC 

£S — 4 — A ~ c ™ 

— . — J£ =s =: s= = 



512 



CTMACOC, ™ TO TGTTGGCTAT CGTGTCATGC AAAAGGATCT AGAAGGGACG 

MOTOTGT 

AAAAACGGCC AGGATCTTGT TTTGGAAGAC GGAACTGAAA TCAAGGCAGC jl™*™.,,. 
TCACCGCCAC GATTATCACT ATT1TAGGAG «A^ ££££ 

AGKKCGTC TGGCTGTCAA TGCAGAltJTC CTAGTTCATG AGTCCACTTA 
OATOAAAAA, ^CTCO.AA CCA^GTCAC ^AACTAATA ££££ 

aaSZ° cctcctactc aaccatatca g^cccg™ 

GATATAAGCA AACTCAAGAA GGACGCTCCC ACAA". TITO AAAATCTCCA — „,„ 
A^a A"" <" »™ 2= «. 



60 
120 
180 
240 
300 
360 
420 
480 



551 



60 
120 
180 
240 
300 
360 
420 
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GGTGATGTCC TTATGGTCAT CGTGAAAAAT AGCGGAGATG CCAAACTGAA TGTGGAACGC 
AAAGGCAAGG TAGCCCGTGT TTTCCTCAAA GAAAATGGGG AAACAGTAG 



480 
529 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



CTAGGATAGA 


CCGCTTTTTA 


GCATTTATCT 


AAGCATTCCA 


GAGTACATGT 


ATCTTGCATG 


60 


TGCTCTTTCT 


TTTGGGGTTG 


AAACGATAGG 


AGAAGGAAAT 


GTTAGAATTG 


CTTAAATCAA 


120 


TCGATGCTTT 


TGCTTGGGGA 


CCGCCCCTCT 


TGATTTATTG 


GTCGGAACAG 


GGATTTACCT 


180 


AACCATGCGG 


CTAGGACTCT 


TGCAGGTTTT 


GCGTCTGCCC 


AAGGCCTTTC 


AGCTTATTTT 


240 


TATCCAGGAT 


AAGGGACATG 


GTGATGTATC 


CAGTTTTACA 


GCTTCTTGTG 


TAACAGGCCC 


300 


TTGGGCATTC 


AAACTGGTTG 


GGAAACAGGG 


AAATATCCAT 


TAGGGAGTTG 


GCGACGGCTA 


360 


TCAAGGTTGG 


TGGACCAGGA 


GCTCTATTTT 


GGATGTGGAT 


GGCGGCTTTC 


TTTGGAATGG 


420 


CTACCAAGTA 


TGCGGAAGGA 


CTCTTGGCCA 


TCAAATACCG 


CACCAAGGAC 


GACCATGGTG 


480 


CAGTAGCGGG 


AGGTCCCATG 


CATTATATCC 


TTCTAGGGAT 


GGGAGAAAAG 


TGGCGACCAC 


540 


TTGCTGTTTT 


GTTTGCAGTA 


GCAGGAGTAT 


TGGTTGCTCT 


CTTGGGAATC 


GGAACCTTCA 


600 


CCCAAGTCAA 


CTCGATTACA 


GAATCTATCC 


AAAATACAAC 


GACGATTTCG 


CCAGCCATCA 


660 


CAG 












663 



(2) INFORMATION FOR SEQ ID NO: 107: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTTGACCAC CAATTCTTCA CCCGATATGA TGCGCCCTGC TCGCTGAGCA AGCCCCAAGA 60 

GATTACTTAT CTTTTGCTTA TTCAAGTCCC AACTCTCTTC TTTTCACTTT GTGATCCACA 120 

TAAGCGATCA ACTCGTCATA AAAGCTTTCT TCCACTTCCA TGCTAAAGCT GCGGTTAAAG 180 

ACCTTCTTCT TTTTCGCCTC TAGGGCTTCT GCATTGTCTA GTTTGATATA AGCGCCGCGG 240 

122 
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CCATTGGCCT TGCCCGTAGG ATCAATAAAG ACTTGTCCTT CCTTGTTCTT GACAATGCGG 
AGCAAATCAC GCTTATCAAT CACTTCGTTA GACACAACAG ACTTGCGCAA AGGGATTTTT 
CTTGTTTTCA TCTTTCCCTC CTCTAGCAGC TTTTATTCTT CTACAGTATC GTTTTCTACT 
TCCAACTCTA CTGAAGCAGC GTCTTCCATG GCTTCAAATT CGCTAGCAGA CTTGATATCG 
ATACGGTAAC CAGTCAAGTG AGCCGCCAAG CGCACGTTTT GTCCACGACG ACCAATGGCA 
AGAGAAAGCT TGTTATCTGG AACAACCACC AAGGCACGTT TGCTGTCGTT TTCATCAAAG 
ATAACTTGGT CAACCTCAGC AGGAGCGATG GCATTGTAGA TAAATTCAGC TGGATCTGCT 
ACCCACTCGA TAACATCGAT ATTTTCTTCG ATTGGTACCA TGCGGTCATT TTTAGCATCG 
TAACGAG 

(2) INFORMATION FOR SEQ ID NO: 108 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CTAATGTAAA TCTATGGGAT AA..CGTTAGC TTCTTTTAAT AGATTATTAA GCAATTTTTT 
AAAACAACTC ATCAAACAAA CTCAACTGGT TATCTCTGGC ATATTTCCAA GAATACCCAT 
CTCATCCATC TTTTCAACCA AGGTTGATGA GAGTCCACCA CGCTTGCGTA GTTCTGTTTT 
AGAGAGGAAT TCTCCCTCTT CACGCGCCCG CACCAGTTGC TTGGCAACGT TCTCCCCCAG 
ACCATCCATT GCTACAAATG GTGGGATAAG GGTATCCCCG TCGATGAGGA ACTCTGTCGT 
CTGACTACAG TAGAGATCTA ATTTACCAAA CTTGAAACCT CGTTCCCACA TCTCATTGAC 
AATCTCAAGA GTTGTATAGA GATCGATTTC CACATTAGAG GCTTCATTGT TCTTCCGTTT 
TTCAGAGATT TCTTCCATTC TGCGCTTGAT GACCTCCAAG CCCGCACCCA TGGTCTTGAT 
ATCAAAAGCC TTAGCACGAA TGGAGAAGTA AGCACAGTAG TAATAAATAG GATGGTGAAC 
CTTGAAGTAA GCTACACGCA AGG 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



300 

360 

420 

480 

540 

600 

660 

720 

727 
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(ii) MOLECULE TYPE: Genomic DNA 



123 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
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CTAACTTCAA 


AAGCTTTTAG 


AATATCTTCT 


TGACTAATCA 


TATGATTACC 


GCCCAAGGTT 


60 


AAGTTTTCCA 


AAATAGAGCC 


ATTAAAGATA 


TAGGCTTGTT 


GGGGTAGGTA ATTAATATGA 


120 


CGGCGCAAGA 


CTTTTTTTAT 


CAATGTTTTT 


AATATCCTGA 


TGATTGATGG 


AAATATGCCC 


180 


TTTGTAGGGT 


TCAAAGAAAT 


TGACAATCAT 


TTTGGCTAAA 


GTTGTTTTAC 


CAGAACCACT 


240 


AACTCCAACT 


AGGCTAACCT 


TATCTCCTTG 


TTTAATCGTG 


AGATTAATAT 


CTGTTAAGGT 


300 


ATCTCGTCCA 


AAACCATACT 


TATAAGAAAG 


GTCATCAAAT 


TCAATATCGC 


CCATCAAAAA 


360 


ATGTGAATGA 


ACAGGGTTTT 


CTTGAACTTG 


AAATTCAGAT 


TCGACTAGAT 


AGACTTCGTT 


420 


CAAACGGTTA 


TTAGCGACCT 


TCGCAGATTG 


GAGTTTGGTT 


TGGAGGTTGA 


TAATATTTTC 


480 


CATAGGAGTT 


GTAAAGTAAG 


AAAGAAGTGT 


GTTAAAGGTA 


ATCAGCTGAC 


CGATAGAAAT 


540 


TTTACTCGAC 


ATGACTAATT 


GAGCGCCAAA 


CCATAGGATA 


AGGATATTCA 


GAACTAATTT 


600 


TGTTTCCCCT 


GCTTTAAACT 


CGTTTGTAAA 


ATAGAATATT 


TACTGAG 




647 



(2) INFORMATION FOR SEQ ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 610 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



CTATCCTNGA 


ACCAATGATG 


CTTGTAACAA 


TCACTGTTCC 


AGAAGAAAAC 


CTTGGTGATG 


60 


TTATGGGTCA 


CGTAACTGCT 


CGTCGTGGAC 


GTGTAGATGG 


TATGGAAGCA 


CACGGTAACA 


120 


GCCAAATCGT 


TCGTGCTTAC 


GTTCCACTTG 


CTGAAATGTT 


CGGTTACGCA 


ACAGTTCTTC 


180 


GTTCTGCATC 


TCAAGGACGT 


GGTACATTCA 


TGATGGTATT 


TGACCACTAC 


GAAGATGTAC 


240 


CTAAGTCAGT 


ACAAGAAGAA 


ATTATTAAGA 


AAAATAAAGG 


TGAAGACTAA 


TCCGTCCTCA 


300 


CTCTAGAAGG 


AAGTCACTTA 


GTGGCTTCCT 


TTTGTCTTTA 


GAAAATACCT 


CTAAATATGG 


360 


TAAAATAGTA 


GAAGAATAAT 


GTGAGGAAAA 


TGAATGTCAA 


ATAGTTTTGA 


AATTTTGATG 


420 


AATCAATTGG 


GGATGCCTGC 


TGAAATGAGA 


CAGGCTCCTG 


CTTTAGCACA 


GGCTAATATT 


480 


GAGCGAGTTG 


TGGTTCATAA 


AATTAGTAAG 


GTATGGGAGT 


TTCATTTCCG 


TATTTTCTAA 


540 


TATTTACCTA 


TTGAAATCCT 


TTTTACAATT 


AAAGAAAGTT 


TGAGCGAAGA 


ATTTCCTNAG 


600 


AANGCCATCA 












610 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 

124 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

TATAAGCATA GTACCAGTTG CCATTGACCT GGAACCAACT GTCTTCATAT CTCCATTACT 
GCATTTAGGT AGTACCAAGT TGAACCATCT TGATACCAAC CAGTTGCCAT TGCTCCTGAT 
GAACGGAGAT AGTACCATTT GTTCCCAAGG TTTTGCCAAC CTGTTTTCAT ATCGCCATTT 
GGCTGGTCTA AATAATACCA AGTGGTACCT TCCTGATACC AACCAGTTGC CATTGCTCCT 
GATGAACGGA GATAGTACCA TTTGTTCCCA AGGTTTTGCC AACCTGTTTT CATATCGCCA 
TTTGGCTGGT CTAAATAATA CCAAGTGGTA CCTTCCTGAT ACCAGCCAGT GGCCATTGCT 
CCTGAGGAAC GGAGGTAGTA CCACTTATTA CCTAGATATT GCCAACCTGT TTGCATAATA 
CCAGTTGTTG GATCTAAATA GTACCAAGTC AAATCATCGT TTATCCACCC CGCACGT^ 
TCACCACCAA GGTAGTTTTC TCCATTAATT TCCGTCTTAG CTAGATAATA CCAGTAGACT 
GATCATAAAG CCAACCTGTC TCTAAAGAAT GATTTTGATT AAAGTAATAG TTCGTATAAT 
AACGNTTCTC TTCTTTATCT TCTGAATCTT CACGTTTTTC CCCGTACTTT CTTC^C 
TGTCTTTAGT TTTAATCTCT AATGTTTTCC AACCAACAAA CTCTTGTAGC ACT^T 
TATCGAAGTA GTACCACTCT GAATTTGGAA AACCTTCTAA TCTCATACCA TTTGGGTAAG 
GACCAATTGT AC 'ACCTTTA GATGGAAACG GGATATATTG CCAGCCGACA ACCATCTCTC 
CAGATAGAGA A.CAAAATAA TAGTACTTAC CATCAATCAC TCGCCAGTAG GTTTCTTTGA 

:™ ~ ~ - 

984 

(2) INFORMATION FOR SEQ ID NO; 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



Stc™ CAGATCAACG " ATTTCTTC TC ^A GTC ACAAGAATTT TGTTGATCGT 
AATCTTTTTA TTACCATTCG TGACAAGGAA ACCACC^TA TCAAGCCTTA TcS^ 
TTGGATTTGC CACATGGTCT GGCCTTGGAT GTTTTGCCTT TGGATTATTA TCCGAAAAAT 
oa^MAC cs™^ ecmakm 

" ,m " »"*•"» ««««»» ore™ 



60 
120 
180 
240 
300 
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ACTCCAAAAT CTCTCCGTTA TCGCATCTGG AAAAAAGCTG AGAAAGAAAT GACTAAGTAT 
GATTTGGCTG ATTGTGATGG CATTACAGAA TTATGCTCAG GTCCTGGCTA CATGAGAAAC 
AAGTACCCAA TCACATCTTT TGAAGACAAT CTTTTCTTGC CATTTGAAGG AACAGAGATG 
CCTATTCCAA TCGGCTATGA TGTCTATCTC AGAACTGCTT TTGGGGATTA TATGACGCCT 
CCACCAGCAG ACAAGCAGGT ACCGCATCAG GATGCTGTCA TCGCTGATAT GGATAAGTCT 
TATACAGAAT ACAAGGGAGA ATATGGTGGC TAAGAAAAAA ATCTTATTTT TTATGTGGTC 
TTTTTCTCTT GGAGGTGGTG CAGAGAAGAT TCTATCAACC ATTGTTTCAA ATCTGGATCC 
AGAAAAGTAT GATATTGATA TTNTTGAAAT GGAGCACTTT GACAAGGGAT ATGAATCTGT 
TCCAAAGCAT GTACGCATTT TAAAATCCCT TCAAGATTAT CGCCAAACCA GATGGATACG 
AG 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



CTGGCAAATA 


CAAGGTGACG 


ATCATTGGTA 


AATCAGCCCA 


CGGTGCTATG 


CCTGCTTCAG 


60 


GTGTCAATGG 


TGCGACTTAC 


CTAGCCCTCT 


TCCTTAGCCA 


GTTTGACTTT 


GCTGGTCCAG 


120 


CCAAAGAATA 


CCTTGACATC 


ACTGGTAAAA 


TTCTCTTGAA 


CGACCATGAG 


GGTGAAAGTC 


180 


TCAAGATTGC 


TCATGTGGAT 


GAAAAGATGG 


GTGCCCTTTC 


TATGAATGCA 


GGCGTCTTCC 


240 


GCTTCGATGA 


AACAAGTGCT 


GATAATACCA 


TTGCCCTCAA 


CATCCGCTAT 


CCAAAAGGAA 


300 


CAAGTCCAGA 


ACAAATCAAG 


TCAATCCTTG 


AAAACTTGCC 


AGTTGTTTCT 


GTTAGCCTGT 


360 


CTGAACACGG 


TCACACGCCT 


CACTATGTTG 


CCAATGGAAG 


AATCCACTTG 


GTTGCAAACC 


420 


TGGTTGAAAT 


GTCTATGAAA 


AACAGACAGG 


CCTTAAAGGT 


CATGAACAAG 


TCATCGGTGG 


480 


TGGAACCTTT 


GGTCGCTTGT 


TAGAGCGCGG 


AGTTGCCTAT 


GGTGCTATGT 


TCCCAGACTC 


540 


AATTGATACC 


ATGCACCAAG 


CCAATGAATT 


TATTGCCTTG 


GATGATCTCT 


TCCGAGCAGC 


600 


AGCAATTTAT 


GCCGAAGCTA 


TTTACGAATT 


GATCAAATAA 


AACGATAGAA 


GTCTGAGATC 


660 


TTATGCTTGG 


ACTTCTTTTT 


GGAGGGAAAG 


TAGATGTCTC 


AAATCGAAAG 


AATCAAACAG 


720 


GCTATCATGG 


CGGATTCACA 


GAATGCCAG 








749 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

126 



360 
420 
480 
540 
600 
660 
720 
780 
840 
842 
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fii) MOLECULE TYPE: Genomic DMA 

(Xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 114: 

CTGGATGAAT TGCAGAAGCG CAACCTACTG GGATTTGTCT TCCAAGATTT TCAACTATTT 
CCTCATCTAT CAGTTCTGGA AAATTTGACT TTATCGCCTG TGAAGACCAT GGGAATGAAG 
CAGGAAGAGG CTGAGAAGAA GGCGAGTGGA CTCTTGGAAC AGTTAGGACT A^G^C 
CCAOAGTCCT ATCCtncC ACTATCTGGT GGGCAAAAGC AGCGGG TC g" T^CGct 
GCTATGATGA TTGACCCAGA AATCATTGGC TACGATGAAC CAACTTCTGC 

gaattacgtt ^caagtgga gaagctaatc ^aaaata SSSS 
AT^ A cccatgattt gcag^c tcaaaa TATC gcaca^ SSSS 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 : 



= = = = = = 

====== 

= = ~ = = = 

====== 

= _ ^ = = = 



60 
120 
180 
240 
300 
360 
420 



503 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



CTGCAATGGT 


TAGCGAGGAT 


GGTGAAATAG 


AGGAGCCTTA 


ACTATTTTGA 


GGAATCCAAG 


60 


CTGACAGGAC 


TGGTATGTCT 


GCTCTTTGGC 


TCTGAAGGCA 


GACTATACCA 


GCTGGAAGTT 


120 


TGACAGATGT 


GGCACGTGAG 


AATTATTGGA 


GGCNTTGACN 


TANTCAGAGG 


ATTCTCTCTT 


180 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


GTTGGAAAAT 


TACCAAGCGG 


TTATTCAAGC 


240 


CTATGCCCAG 


HAuA 1 AA i\. 


X UVjAl X In 








300 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


360 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


AOW 111 lUAU 


X 1\jOCL. Ao 1 


X X 1A1 I X X\xn 




TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


480 


CTTTGAAGGC 


TATGAGTATG 


GGTACAGTCA 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


540 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


600 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


660 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


720 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


AGACTTGCAG 


Au 1 uAl?uAi7l~ 


LnunvuinlL l 


780 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


840 


TGAGCATTAT 


CAAGAGTTGA 


CAGGAGATTT 


GAAGGACAAT 


CLA< . l AIl 1\- 


X uunnLAL I A 


900 


TATCTATCTC 


TTGCGTTGAA 


TTGGGACATT 


TTGAAGAAGC 


AAAATCCCAT 


GCTC 


954 


(2) INFORMATION FOR SEQ 


ID NO: 117: 








(i) SEQUENCE CHARACTERISTICS: 








(A) 


i LENGTH: 564 base pairs 








(B) 


\ TYPE: nucleic acid 










(C) 


1 STRANDEDNESS : single 










(D) 


1 TOPOLOGY: 


linear 










(ii) MOLECULE TYPE: Genomic 


DNA 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 117: 






CTATGAAACA 


TTTTGATACT 


ATTGTCATCG 


GTGGGGGACC 


TGCTGGTATG 


ATGGCTACGA 


60 


TTTCCAGTAG 


CTTTTATGGA 


CAGAAAACCC 


TCCTCATCGA 


AAAAAATCGG 


AAACTTGGAA 


120 


AAAAATTAGC 


TGGGACTGGT 


GGGGGACGTT 


GCAATGTGAC 


CAACAATGGT 


AGCTTAGACA 


180 


ACCTGCTAGC 


TGGAATTCCT 


GGAAACGGAC 


GCTTTCTTTA 


CAGTGTTTTC 


TCCCAGTTCG 


240 


ATAATCATGA 


CATCATCAAC 


TTTTTTACAG 


AAAATGGTGT 


TAAACTTAAG 


GTCGAAGACC 


300 


ACGGACGCGT 


CTTTCCAGCC 


AGTGACAAGT 


CTCGGACTAT 


TATCGAAGCT 


TTGGAAAAGA 


360 


AAATCACTGA 


ACTAGGTGGT 


CAAGTTGCTA 


CTCCAAATAG 


AAATCGTTTC 


TGTTAAAAAA 


420 


GTAGATGACC 


AGTTTGTCCT 


TAAGTCAGCG 


GATCAAACCT 


TCACTTGTGA 


GAAACTCATT 


480 



128 



(2) INFORMATION FOR SEQ ID NO: 119 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 



CTTAGTATCT AGAAAAGGAG AAATAAAATG GTTAAAGTAT TAGCAGCGTG CGGAAATGGA 
ATGGGTTCAT CAATGGTTAT CAAGATGAAG GTTGAAAATG CTCTCCGTAA GCTTAATCAA 

acaga™a cagtcaa™ akcagtgtc gg TC aagcta aagg^agc agtaggatat 

GACATCGTAA TCGCTTCTCT ^A^A^ CAAGAATTGG AAGGGCGAAC TAA TC ^G 
TTAATTGGGC ttgataactt gatggat^at AAAGAAATCA CCGAAAAACT cagtcaagca 



540 
564 
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gtcacaacag gtggtaagtc ttatccttcg actggttcga ctggttttgg tcacaatgct 
ccgccattta agcataccat cacg 

(2) information for seq id no: 118: 
(i) sequence characteristics: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 118: 

CTCCGTGGTC TTTTGGTTTG CTACGGGATG TAGGCAAAAG GCAAGTCCAT TTTATCAGCA 
ATAATGGCTC CGTGTGGAAT CCCTGCTGTT GCAGTTCCTG CAATCACTTC TACTTCAGGA 
AAGGCTTCTT TGATAGCTTC CACAAAACCA TTTTCAATTA GGGTACGAGT TTCTGGATAG 1GU 
GCTAGTGTCA CACGATTATC AGTGTAAATC GGTGACTTGA TACCAGATGC CCAAGTGAAG 240 
GGTTCCTCTG GTTTGAGGTA AACGGCTTGG ATTTTCAAGA GGTGGCTAGC GATATCTTTA 300 
GCAAGTGTCA TGGTATTCTC CTTTTATTTT TCTAATCTAG TTCTTTAATT CCAGTCCTGT 
GTCCATTCAT CCTTGATGGC ATGATAAGCT GCAACAGGAT CCTCAGCTTG GGTAATGGGA 
CGTCCCACTA CGATATAGTC ACTGCCGATT TGATAGGCAT CAGCAGGTGT CATGACTCGT 
TTTTGATCTC CAACTGCAAC ACCAGCTGGA CCAATCCCTG GTGTCCCGAC AGATAAAAAT 
CTGGATTGGT AGCCCTGCTT GATGACTTTG TCACTTCCCT GAGCCGAGCC AAACCAACAC 
CCATCCCAAA GCCCAAGCTT CAGCTGTCTT CCTTGGCATA GTTGAATCCA C 



60 
120 
180 



360 
420 
480 
540 
600 
651 



60 
120 
180 
240 
300 



129 



ATACAGTAAA AGGTTGGAGG GGGCTGGACA CAAACTGAGA GTTATCGTTT CTGTCCTTCT 
CCCTCTTTAA ATAAAGGAGG CAGATATGAA TTTAAAACAA GCTTTAATTG ACAACGACTC 
GATCCGACTA GGTTTAGAAG CTAACGAATG GAAAGAAGCA GTCAAGGTAG CAGTAGATCC 
CTTGATTGAA AGTGGGGCAA TTTTGCCAGA GTATTACGAT GCTATCATTG AATCGACTGA 
AGAGTATGGG CCTTACTATA TCTTGATGCC AGGTATGGCT ATGCCCCACG CTAGACCTGA 
AG 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i> SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 586 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATTGGGAATT TTTGCAACTG CAATTGGTGC CCTCAGTAAT CTATAAAATA GATTCAAGAA 
AATTTAGTGA CTGGGATTTC CCAGCCCTTT TTTAAAGTGA GAAGAAATAA TGAGTATGTT 
TTTAGATACA GCTAAGATTA AGGTCAAGGC TGC AATGGT GGCGATGGTA TGGTTGCCTT 
TCGTCGTGAA AAATATGTCC CTAATGGAGG CCuTTGGGGT GGTGATGGTG GTCGTGGAGG 
CAATGTGGTC TTCGTTGTAG ACGAAGGACT ACGTACCTTG ATGGATTTCC GCTACAATCG 
TCATTTCAAG GCTGATTCTG GTGAAAAAGG GATGACCAAA GGGATGCATG GTCGTGGTGC 
TGAGGACCTT AGAGTTCGAG TACCACAAGG TACGACTGTT CGTGATGCGG AGACTGGCAA 
GGTTTTAACA GATTTGATTG AACATGGGCA AGAATTTATC GTTGCCCACG GTGGTCCGTG 
GTGGACGTGG AAATATTCGT TTCGCGACAC CAAAAAATCC TGCACCGGAA ATCTCTGAAA 
ATGGAGAACC AGGTTCAGGA ACGTGAGTTT ACAATTGGAA CTTAAA 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 

TTCGACAATC TTGCCCNNGA TCCAAGACAA CCACTTCTCT GTCCGCTCAG CAATAGTCAA 
GCGGTGAGCA ATGAAAATCA AAGGTCTTGT CCAAAGCCAT GAGATAATGG ACAATCCGTT 
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TTTTTGTCAA AATATCCAAA CTGTTAGTCG CCTCATCCAA AATCAAGACC GGCGCATCTG 180 

TCAAGAGAGC ACGCGCCAAA GCGATTCTCT GACGTTGACC ACCTGAAATC CCTGCCCCAT 240 

CCGAAGTCAA TTCTGTCTGG TAAATTTCAA GGTGGCATGG CGGCTTCGAG ATATTCCTCT 300 

TCGGAATCTC TGAACCAAAT TCGGACCGGC CCGTTAAAGA TATCTTCCCT GTGTCGTCCC 360 

CTCCCTTGGC TCCCAAAAGA AGATTCTCCA AAATCGTTCC GTTAAAGACA TAGGGCTGTG 420 

GAGGCAGATA GATGATGTAC TGGCGTAGGG CTTTTTTATC AATCTGATTG AGATTGACAC 480 

CACCCAGACT AATCTCCCCT TGACTTGGGT CGTAAAAATT AACCATCATC TTGGCCAAAG 5 40 

TCGTCTTACC TGACCCTGAA ATCCCCACAA AAGCCACCTT AGACCCTTGG GGAACGGTTA 600 

AATTGATATC CGACAAGACG TCTCGACCAT AGCCATACTT GTAATGAACC TGCTTGAAAG 660 
TCATCTCTCC CTTCATCAAG 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

CCAATTTGGA TTTGCGAGAA GGCGTTATTA TTTCAGATGC TGGTTCGACC AAGTCAACCA 
TTGTGGATGC GGCGGAGCAG TATTTGGCTG GCAAGTCTGT TCGCTTTGTC GGGGCCCATC 
CCATGGCTGG TAGTCACAAG ACAGGGGCTG CTTCGGCAGA TGTCAATCTT TTTGAAAATG 
CCTATTATAT CTTTACAACC TTCCAAAGCC CTGAACAAGT CAAGGACAAC GCTTAAAGGA 
AATGGGAAAG GATCTGCTTT CCAGGTCTTC ATGGCTCGTT TTATCGAGAT TGATGCCAAG 
GAGCATGATC GTGTCACTTC TCAGATTAGC CATTTTCCTC ATATTTTGGC TTCTAGTCTC 
ATGGAGCAGA CTGCGGTCTA TGCTCAAGAG CATGAGAATG GCAAGGCGCT TTGCGGCAGG 
TGGTTTTCGA GATATGACCC GAATTGCGGA AAGCGAGCCA GGAATGTGGA CCTCCATTCT 
CTTGTCCAAT AGCGAGACCA TTCTGGATAG AATTGAGGAT TTCAAGGAAC GTTTGGAAGC 
GATTGGTCAG GCCATTAGTA AGGGAGATGA AGAGCAAATT TGGAACTTTT TTAACCAAGC 
G 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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680 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
601 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



CTGGCTCATC 

w X www X v 


ziwwWw/\wn 


TATTTATY2AT 
ini x XAXunx 


1 X X 1 w 


LiAHaAl 1 AA 


GGAAGATAAT 


60 


V_ X wwnO XXXI 


\*n X X X w X Vjvj 


xxx uiwin x w X 




Al\. I I UACCA 


CAATGCCGAG 


120 


CAAAAGGGAG 


AAAGTCTGTC 


TACAGAGGAT 


TTGTGTGCTT 


CCTTCCAAGC 


AGCTGTACTG 


180 


GATATTCTCA 


TGGCAAAAAC 


CAAGAAGGCT 


TTGGAGAAAT 


ATCCTGTTAA 


AACCCTGGTT 


240 


GTGGCAGGTG 


GTGTGGCAGC 


CAATAAAGGT 


CTCAGAGAAC 


GCCTAGCAGC 


CGAGGTTACA 


300 


GATGTCAAGG 


TCATCATTCC 


ACCTCTGCGC 


CTCTGCGGAG 


ACAATGCAGG 


TATGATTGCT 


360 


TATGCCAGTG 


TCAGCGAGTG 


GAACAAAGAA 


AACTTTGCAA 


ACTTGGACCT 


CAATGCCAAA 


420 


CCAAGCCTCG 


CTTTTGATAC 


CATGGAATAA 


AGAGTGGCTC 


TTTGTCAAGT 


GTAGTGGGTA 


480 


GGCGAAAAGC 


TACAATCTGG 


AGANTACGAA 


ATTC 






514 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYr ; Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 



CTTGGATTAT 


AAAATACAAT 


CGGCAAGATC 


ATAAGTCCCA 


AGCCTAAAAA 


TATATAGAAA 


60 


GGGGGTCACC 


TTCCAAAGAA 


ATTCTGTATT 


AAAGAGCATG 


ACCACAAAAC 


CAATCACAAG 


120 


CCCCAAGGCA 


ATCCAGGCGA 


CCTGCTGCCC 


TAAAATGGGC 


AGAATATTAT 


TGGGGTAATC 


180 


ATGACTAACG 


GCTATATAGA 


TAGCCACCAC 


ACCGATGACC 


AGTAGAAAAA 


ATACTGGCAA 


240 


GAGCAAACTG 


TAATCGACTC 


TTGAGTCGAG 


AGAACGTTTC 


ATATAAACTA 


ACCTTATACT 


300 


TTCATACAAT 


ACTATTTATC 


AAAGTTCATT 


AAAAAATCTA 


TCAATAGCCT 


CGTCAACTTC 


360 


GGATCGAGAG 


ATGGTTTTAA 


CAGTCGCTTC 


TTCTGCTAGA 


GATGCTACTA 


TTTGTTTGCC 


420 


GTATCGTTTT 


CCGACGATTC 


TCCTATCCAA 


AATAAGAGTT 


AAGGAACGTT 


GGTATTCACG 


480 


TCTCATACTT 


CTTCCCAAAG 


CCTGTTTTAA 


ACGAATAATG 


G 




521 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

132 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

CTTGCCTGAT GACATCATTG ACTCTTTTTG GTATATCATC GACCATTTCT TAAAAAATGT 
CTTTGAATTG GAAGAAGAAC TCGAGTTTCA ATTGCTTAAT AACCAAGGAA AGATTACCTT 
CCACTTTTCA AGTCAACACC TCCCTACAGC CATTGATTTT GACTTTAACC ATCCTTTCGA 
CCCTCGTTAT CCCCCAAGAG TACTGGTTTT AGACATGGAC GGTAGAGAAA CTATCCTCCT 
CCCAGAAGAA AATGACCTAT TTTAAAAACT CTAGCCTTCA GTTGCAAGTG ACTGAAAACT 
AGAGTTTTTC TATTTTTTCA AAGCATCATA CAAGTTGCGG ATCGGTTGTT TTAATATCGG 
ATGGATAAAA TGAGGCGCAA TTTCCTGTAA GGACTCAAGG ACAAAAAGGC GTTCCCGCTA 
TGTTAAGGGA TGAGGCAATA TGAGGTCGTC TGTATAAAGG ATCTGGTCCT CCCACAAAGA 
GCAAGTCCCA AATCAATCCA AACGAAGTCC CCCAATGCAC TTCTCTCACC CGTCCCCAGC 
TCTTGACTCC AATGGCTAAC AAGGTTTCNT AACAATTCTT GTTGCTGGTA GCCAAGTTC 

(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
599 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

CTATGGGTAA AAAGGATGCC TCTGCTATGC ATGAGATGAG GGCTTCCTTT ATTCAAGGCT 
CCATAGAAGC AGGCCATACT GCGGAAAAAT CAGAGCAGGT TTTTGATGTT ATGGAGAAGT 
TTGCAGGTTA TGGATTTAAT AGATCTCATG CCTACGCCTA CTCAGCCTTG GCCTTCCAGT 
TGGCTTATTT CAAAACGCAT TATCCAGCCA TTTTTTATCA GGTCATGTTA AATTATTCCA 
ACAGTGATTA CTTAATAGAT GCACTTGAAG CAGGTTTTGA AGTAGCCTCT CTATCCATCA 
ATACTATTCC CTATCACGAT AAAATTGCCA ACAAGTCTAT CTATATAGGT TTGAAATCGA 
TTAAGGGGCT CCAGCAAGGA CTTGGCGCTT TGGATTATTG AACATAGACC TTATTCTAAC 

atoacctcag 

564 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



ACTAATCACT 


TACTACGTAG 


CTTCTGACTA 


TCTACCTCTA 


CTAGATAAAG 


uALAAAL l\3 1 


60 


AAGATTAAAA 


CTGGAGAAGN 


TTGGAAATCA 


CGGCATTACC 


ACCATCGGCC 


AACTTCAGAC 


120 


AATTGATCAA 


ACTCCTACCA 


GAACAGAGCA 


AGGCAATCTC 


TTTAAATTAA 


CCGCTCTTGC 


180 


AAAACTATCT 


AATGAGGATA 


GTAAACTCAT 


CCAATATGGC 


TTACAAGGTC 


GCGTCACTAG 


240 


TGTAACTGCA 


AAGAAAACAT 


ATTTTGATTA 


TTTCAAAGAT 


AAAATTTTAA 


CCCATTCTGA 


300 


TTAATTTTCA 


GATAACACTC 


TATAACTATT 


TATTATCTTA 


TCAAAAAGGA 


GAATCATAAC 


360 


ATGGATAAGA 


AACAAAACCT 


AACTTCATTT 


CAAGAACTAA 


CAACTACCGA 


ACTCAACCAA 


420 


ATTACAGGTG 


GAGAATGGTG 


GGAAGAACTC 


TTACATGAAA 


CAATTTTAAG 


TAAATTTAAA 


480 


ATCACAAAAG 


CACTTGAACT 


ACCTATTCAG 


CTATAAAAAC 


AAGACCGAGA 


AACAAGAACT 


540 


CTCGGTCTTG 


TTTTTTATCA 


TTCTGCATGT 


ATCACAGTAA GTACCTGACG 


AAAGACTTGA 


600 


TTTTGGCGAG 


GTAGTATT 










618 



(2) INFORMATION FOR SEQ ID NO: 128: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 549 base pairs 
(V TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



CAGGGGATGC 


CAGTTGATTT 


GCTTCAAGTT 


GACTTGACTC 


GTACTTGGGA 


AATCCTCGGA 


60 


GAAATCACTG 


GGGATGCTGC 


TCCAGATGAA 


CTCATCACCC 


AACTCTTTAG 


CCAATTCTGT 


120 


TTAGGAAAAT 


AAGAAAAATC 


CATGATCCTT 


CATTCGGTCA 


TGGATTTTAT 


TGTCTTTATT 


180 


AGTAATCTGG 


TCTTAAGACC 


CCTGTTACAG 


TTGCCTTAGT 


TGCTTCGTAG 


TCGCCATCTA 


240 


CGACAACCTT 


GATAATGCGT 


TTGACATCTT 


CTTCTGGTGC 


TGGAACAAGA 


GGTAGACGAG 


300 


TGGGTCCAGC 


TTCAAATCCC 


ATATAGTTAA 


GAATTGCCTT 


AACTGGAGCA 


GGACTTGGAT 


360 


AAGAGAAGAG 


AGCATTAACC 


TTAGGAATGA 


ATTTACGCTG 


AATTGCTGCG 


GCTTTCTTCA 


420 


TATCGCTTTC 


TGCAATGGCA 


GTAAACATCT 


CGTGCATTTC 


ATCCCCATTT 


GTATGAGAAG 


480 


CAACAGAAAT 


AACCCATCCG 


CCCCAAGGTT 


CATGGCATGG 


AAAGCATCTC 


CATCCTCACC 


540 


TGTATAAAT 












549 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 455 base pairs 

134 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CTGGCTANCA AAATCATACT CATAATCCTT AACACCGACA GAGATAGAAG CAATGAGCCC 
^GATC GTCC ATTCGTTTAA TAAAAGGAAT GCGTCCTGCC TCATCAAAAC 

a0CMn °™ CTC — TTCATCCAAA ATCGTCTGCA 

C^T^S GC™^ mAAAGG TAAA ^ CTTGTATCCG 

!^ ^TTTTAATG ACACATTTAT TTGGAATCAA TTGAATATCT TCGTAATCAA 
AAATTGGAAA TTCATTTAAC ATATCGATGT CTCGTTTCTT TTGTAATGAC CTACCT^ 
TCTCGCATCA CTACGCCT^ TCCGACGTTT CCCTTAAATT TATTATAAAC ^ 
GTTTTTGTCA AATTATTTCA TGAATNAAAT ATATC ^AGTACA 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
455 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

CTCCTGGGIT TTGATTAACT CTAGGGTTGC CAAAAAGAGG GTGATGACTT CTTGGACATT 
CAC^3 TTGAACAAAT CCTCCAAGCG ««««T CGTCCAATCA AGGATTCTTT 
r ATGTCCT CTCATCCCGC AAGATAGTCG TGTGaT^ 

™" Sc™ """"V* ATTTGAAAAA GCCAAAAAGA GGTCAATCGT 
CGTCTTGTCA TGCACAAGCT CCGCATCTTC GTAAATCAAC TCTATCGGCG CTTTGGAATA 

St™ """" CCAAGTOCTC AC ~ ^ - 

GATATTCTTC GATTTGAGAG AGGAGGTCCT GCTCCAGGTC ATCCCCCAAG TCTrrrznZ 

GTTTTCTATT Trrrrwin* A rTTCGT AC A AATCGACCTT 

ATT TOTCOTAGA ATAAACTGTT CACGTAAGAC TTGAGATCGA ATTCCT 

(2) INFORMATION FOR SEQ ID NO: 131 : 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



ATTGATACCN 


ATTGTACAGG 


ACAACTTATT 


TTGGAGCGCC 


ATAGCAGGGA 


TANGGTTAGC 


60 


AATAGATGCG 


NTGACCATAG 


AAACGTTTNT 


TAGCCAATAC 


ATAACCTGCT 


AGAGATGAGG 


120 


TTGCACAAAC 


TAAGAACATG 


GTTACCAATG 


AGATAAATAC 


TGAGTTCCAC 


ATCCATTGCA 


180 


AGGCAGGGTT 


CTGCACCATG 


AGTTGTTGGA 


AGTTTTCCAT 


GGTTGGCATT 


TTAGGGAACC 


240 


ACTGAGGAGG 


AATAACAATT 


GTATCAGGTT 


GTGATTTGAA 


TGCCCCTGTC 


AAAATCCAGT 


300 


AGAATGGAAA 


GATGNACAGC 


ACAGTCAACA 


AGAGCAAAAT 


GATTGTTGAA 


ATNACAGTAA 


360 


AGGCTGTTAA 


TGGTTTTTTT 


TCTGTAGATT 


GCATAGCTGT 


CTCCTTTCTT 


TAGTATTCTA 


420 


CGTCGTTTCC 


AAGTACTTTA 


AATTGAACAA 


AGCTTACGAT 


AGCAATCATG 


ACTGCCAAGA 


480 


AGACACCAAT 


TGTGTTGGCA 


TAGCCGTATT 


CTGTCAATTG 


GAAGGCTTTT 


TCGTAAAGGT 


540 


AGTACATCAA 


GGTACTTGTT 


GAGTAGTTTG 


GACCACCAGA 


TGTCAAAAGC 


TGAATCAAGG 


600 


CGAAACACTG 


GAATGAGTTA 


ATTGTTGTGA 


TGATTGCAAT 


ATAAAGAGTT 


GTTGGAAGAA 


660 


GGCTTGGCCA 


TTTAATCTTC 


CAAAAAACTT 


GAAACTCAGT 


TGCACCATCA 


ACACG 


715 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



CTGGGCTGAG 


TTTTCCAGAT 


ACAAATACTG 


CAACCGCAAC 


TGCAATCCCC 


CAACCCATAG 


60 


TAATCACAAT 


CCAACCTGCA 


CTGTTGCTCT 


TGGTTTTTGG 


AAGAACCACA 


CCTGCAACAA 


120 


CACCATTTCC 


TAGAAGAATC 


AGGATTAAAG 


TCCCTAAAAA 


TTCTCCAAAT 


AATTCACTCA 


180 


TCATTTTTCT 


GTCTCCATTA 


AAAAGAAGGG 


GCGGGCGACA 


AGGATTGCTA 


CCCTCCACCT 


240 


CTTTTATTTT 


TTCTTAATTT 


TTTAATTCTG 


CTAAGTCGTT 


TTGAGCAAGA 


GCTGCTTTTA 


300 


CATCAGCACG 


GTAAGTTGCT 


TTTTCTTCTT 


CTGTCCAGTC 


ATAGAATCGT 


CCCATTTCAT 


360 


CCAAAACTGG 


CTCAACGATA 


CTATCCAAGC 


TATCACGCAT 


AAAGAGCATG 


TGATTGGTAC 


420 


GACGAAGAAG 


GAAGTCAACT 


GGGCTAAGAG TCAACTCATT 

136 


GCGCATTGCA 


TAGTGAAGGG 


480 



WO 97/37026 PCT/US97/05306 

ACAAG 



485 



(2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



{ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 



CTTAATTATA ACTTAAACAC AAAAGTTTTA CACAAACTGT GGATAACTCT TTTGAAACTG 60 

TGATTTTCTT AATTGAAATC TATTTTTTAT TTTGTGAATA AGATGTGAAA AAATAGAGAA 120 

TATGTTAGAA TAGAGTCATG AAAATTAAAG TTGTAACAGT TGGGAAACTG AAAGAAAAGT 180 

ATTTAAAAGA TGGTATCGCA GAGTATTCAA AACGAATTTC TAGATTTGCT AAGTTTGAAA 240 

TGATTGAGTT ATCAGATGAA AAAACACCAG ATAAGGCCAG TGAATCAGAA AATCAAAAGA 300 

TTTTAGAAAT AGAAGGTCAG AGAATTTTAT CAAAAATTGC TGACCGTGAT TTCGTTATTG 360 

TGTTAGCCAT TGAAGGGAAA ACTTTCTTCT CAGAAGAATT TAGTAAGCAG TTAGAAGAAA 420 

CTTCTATAAA AGGANTTTCT ACTCTTACTT TTATTATTGT SGGAAGTTTA GGATTGTCAT 480 

CATCTGTAAA AAATAGAGCC AATCTTTCTG TCAGTTTTGG TCGCCTAACC TTGCCTCATC 540 

AGTTAATGAG ACTAGTTCTT GTTGAACAAA TCTATCGCGC TTTTACGATT CAGCAGGGAT 600 

TCCCCTACCA TAAATAGAGA ATTGACTTTT AATTGAATTT TTGGTAGAAT AATTGTGTTA 660 
GGTCTCATAG 



670 



(2) INFORMATION FOR SEQ ID NO: 134: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



CTATAATTTT ATTAGCTGTA TTCCAGTCGT CACGACCAAA CTCTGTTACA GGGACACGAA 60 

TGTCAAAACG GTTCTCAATC TCCACAATCA ACTCAACCGT TCCCATACTA TCCAAGACAC 120 

CTGCATCAAA AAGATCTTCA TCCATCATGT CAGAAACATC TTCCATAAAC AACTCATCAA 180 

TAATTTCGAT AACTTCTGAT TTGATATCCA TATTTTATTT CCTTTTATTT TTTAAACCAT 240 

AGATTATTCA AGAATCCAGA AAAGATTAAG AATGACAACA TGACAACATG GAAAGTGACA 300 
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ACCATGCCAA GCAACTGAAT CCAGCGATTC TCAGGTAGGG CAGCCTTCCC TGCTTTTTTC 360 

CGTTCCTTAT TGAGCGTTTT TTTCTTGCGA ACCCAGGCAT CATTGATGAA CAAGCCTAAG 420 

TCCATGAAAG AGTCCCATAG GCGATATAGT ACCAGGTCAC ACCATGCCAA AATCCCCATA 480 

ATCCAGCATA TTTACAATGT AAGATC 506 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 



ATGGTTCGAC 


CCACAAATAT 


GTTTTGCGAA 


TCATTGACCA 


TTCTATGATT 


TACGTTGCCA 


60 


TTGCCGGCTC 


ATACACGCCC 


GTTGTCTTGA 


CCTTGATGAA 


TAACTGGTTT 


GGCTATCTGA 


120 


TTATTGTCAT 


CCAATGGGGA 


ACGACCATCT 


TTGGTATTCT 


CTATAAAATC 


TTTGCTAAAA 


180 


AGGTCAATGA 


GAAATTTAGC 


CTTGCTCTTT 


ACCTGATTAT 


GGGCTGGTTG 


GTTCTGGCTA 


240 


TCATTCCTGC 


CATTATCAGT 


CAAACNACAC 


CCGTTTTCTG 


GATTCTCATG 


GTAACTGGCG 


300 


GACTCTGTTA 


TACAGTTGGA 


GCTGATTGAT 


AGCCTTTATG 


CATTGAAATT 


TGACAGTTTG 


360 


ACTTATGAAA 


ATAAAGCAGA 


GGTTATGGAC 


TTTATCAAGG 


CTCGTGTTGA 


TAAGATGATG 


420 


GGCTCTACTC 


CAAAAGATAT 


CAAGGGAAGC 


AGTTCTTGCA 


GGTTCAAACT 


TTGTTGTGGC 


480 


AGATATGTTT 


GGAAGCAGCA 


AGTGCTCTCG 


TAGAAGTAAG 


CAAGGAAGAA 


GATTTTAAAC 


540 


CATCTGTTGA 


ATCACTTTCT 


CGTGCCTTTA 


ACCTGGCTGA 


GAAGGCAGAA 


GGGGTTGCTA 


600 


CAGTTGATTC 


AGCACTATTT 


GAGAATGACC 


AAGAAAAAGC 


TTTGGCAGAA 


GCAGTAGAAA 


660 


CACTCGTTTT 


ATCAGGACCT 


GCAAGTCAGC 


AATTGAAACA 


ACTCTTTGCG 


CTTAGCCCAG 


720 


TCATTGATGC 


TTTCTTTGAA 


AATACTATGG 


TAATGGCTGA 


AGATCAGGCT 


GTCCGTCAAA 


780 


ATCGTTTGGC 


AATCTTGTCA 


CAACTAACCA 


AGAAAGCAGC 


TAAGTTTGCT 


TGTTTTAACC 


840 


AAATTAACAC 


TAAATAAAAT 


TTGATAAACG 


GACTTTATCT 


TATTACAAAG 


GAGAAGAAAT 


900 


GGATCCGAAN 


ANATTGCTCG 


T 








921 



(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



138 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 : 

CTTGGCAACA GGCTCTTTTT GTATTGCGTT TATTAAGAAA AACAAAAAAC ATCGATATAA 
CATTTAACTA AAAATTATTG TATGTTATCT CCCTTAATTA GGAATGATAA GGGAATAACT 
AGAAAGATTT GTGAATACAA ACTATTTCTG ATATACAAAA TATACAGTAA TAATGAATGA 
TGGGAGATGG GATGAAAGAA TTTCAATTTG AGAGAAAGCA GCGTTTTTCT TTGAGGCCAT 
ATGCAATAGG AGCTTGTTCG GTCTTGCTAG GAACGAGTTT ATTTTTTGCT GGTATGGGTG 
CTCAGCCTGT ACAGGATACA GAAACGAGTT CAGCACTAAT TTCAAGTCAT TATTTGGATG 
AGCAGGATTT ATCTGAAAAG CTGAAATCTG AGTTGCAATG GTTTGAATTA GAAAACAAGC 
TTTTGAACTT ATGGGAACAT TAGGTTACTA TGAAGGATTT GTTCCTTATG TTTCAAATCA 
ATACAAAAAC CAAGCTGAAG AAGAAGGCAA ACCGCTATCT GATAAATATA TTTTCGAAAA 
ATCTTAAGAA AAACATATGC ACCTTCAAAA A 

(2) INFORMATION FOR SEQ ID NO: 13 7: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
571 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomi : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GTATCAACTC CAAAGAGCAA GTGACTGTCA AAGTTGTAAC AGATGCGGCC AAGAAGCTCA 
TGGGTGATAA GATTGCTCGC CAGAAAGAAC GTGGGATTCA GATTGANACC TTGCGNACCA 
TGATTATNGG GATTCCANAC GNTGGTAAAT CCACTCTGAT GAACCGTTTG GCTGGTAAAA 
AGATTGCTGT TGTTGGAAAC AAGCCAGGGG TCACAAAAGG TCAACAATGG CTTAAAACCA 
ATAAAGATCT GGAAATACTT GGATACACCG GGGATTCTCT GGCCTATAGT TTGAGGATGA 
AACTGTTGCA CTTAAGTTGG CATTGACTGG AGCTATCAAG GATCAGTTGC TTCCTATGGA 
TGAGGTTACC ATTTTTGGTA TCAATTATTT CAAAGAACAT TATCCAGAAA AGCTGGCTGA 
ACGCTTCAAA CAAATGAAAA TTGAAGAAGA AGCGCCTGTG ATTATTATGG ATATGACCCG 
CGCCCTCGGT TTCCGTGATG ACTATGACCG TTTTTACAGT CTCTTCCGTG AAGGAAGTCC 

gL™ C ACTCGGTAAC TATACC " AG mGG * AGACCTCGAT GGCNACGATT 600 

611 

(2) INFORMATION FOR SEQ ID NO: 138: 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1054 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



TTGCTAAAGC 


AGCCATGATG 


ANGGCACTCC 


ATAAGGATAA 


GTACCACCAA 


GCAGCAGCAG 


60 


ANGACCATAA 


TCTCCTTTAT 


GACTTGAACG 


AGAACGTTCA 


ATAATAACTT 


TTTCTAGTAA 


120 


GGTTTGATTA 


ATCACTTTCA 


TCCTTTTTCC 


CTCTCACTTT 


TATTATACAA 


CAAAAAGGAG 


180 


ACGCAGACCT 


CCTTTTGTAA 


TCTTATATCT 


AAAATTTAAT 


ATTCATTTCT 


GCCATTTTAG 


240 


ATATAGCTAT 


AGAAAATACA 


CTCTATTAAT 


CGAATGTTTC 


TCTTATTTTC 


TATCCAATGT 


300 


CCGAAGTGCT 


GCTTGATAAG 


TTTGCTCCAT 


CAGCATGGTA 


ATGGTCATAG 


GACCGACACC 


360 


TCCAGGGACT 


GGCGTGATAT 


GGCTAGCAAG 


TGGTGCAACT 


GCCTCATAAT 


CAACATCTCC 


420 


ACAGAGCTTC 


CCATTTTCAT 


CTCGGTTCAT 


CCCAACGTCA 


ATGACAACCG 


CACCTGGTTT 


480 


GACAAAGTCA 


GCAGTCACAA 


ACTTGGCGCG 


GCCGATTGCG 


ACTACAAGAA 


TATCTGCTTT 


540 


AGCAGCCACC 


TTGGCAAGAT 


TATGAGTTCG 


TGAGTGGGCC 


AAGGTTACTG 


TCGCATTTTT 


600 


AGCCAAAAGA 


AGCTGAGCCA 


TAGGTTTTCC 


AACGATATTT 


GAACGACCGA 


TTACGACCGC 


660 


ATTTTTACCT 


TCCAAGTCAA 


TCCCATATTC 


ATGAAACATT 


TCCATAATTC 


CTGCAGGTGT 


720 


CGAGGGAATC 


ATGACTGGAT 


GTCCAGACCA 


AAGACGTCCC 


ATGTTTAGGG 


GATGGAAACC 


780 


ATCCACATCC 


TTTTCTGGGT 


CAATGGCTAA 


TAAAACCGCC 


TCTTCATCGA 


TATGTTTTGG 


840 


TAATGGCAAC 


TGGACCAAAA 


TCCCATGCCA 


AGCTGGATCC 


CTGATTATAT 


TTAGCAATCA 


900 


GGTCTAACAA 


TTCCTCTTGA 


GTTAATGGTC 


TCTGGAACTC 


GCACTACTTC 


GCTACGGGAA 


960 


CCAGCCGCAA 


GAGCTGACCT 


CTCCTTGTTG 


CGAACGTTAA 


ACTTGGCTGG 


CTGGATTATC 


1020 


CCCAACCAAA 


ATCACTACCA 


ACCAGGCACT 


AGAG 






1054 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



CTGAACTGGA 


TTTTGCTACA 


AGGTCTGATT 


TGTCAAAGAG 


ATGCGAGTTA 


TGATATGAAG 


60 


CAGGATGATT 


TGGATAAGGT 


AGCAGATTAT 


CTCTTCAAAA 


CAGAAGAATG 


GACCATGTAT 


120 


GAGTTGATTC 


TTTTCGGTAA 


CCTCTATAGT 


TTCTACGATG 


TAGACTATGT 


CACTCGGATT 


180 


GGTAGAGAAG 


TTATGGAGAG 


GGAGGAATTT 


TACCAAGAGA 


TTAGTCGCCA 


TAAGAGATTA 


240 


GTGTTGATTT 


TGGCCCTCAA 


TTGTTACCAG 


CATTGTTTAG 


AGCATTCTTC 


TTTTTATAAT 


300 


GCCAACTATT 


TTGAGGCTTA 


TACAGAGAAG 


ATTATTGACA 


AAGGTATTAA 


GCTTTATGAG 


360 


CGTAATGTTT 


TCCATTATTT AAAAGGTTTT 


GCCTTATATC 


AAAAAGGACA 


GTGTAAAGAA 


420 


GGCTGTAAGC 


AGATGCAAGA 


GACCATGCAT 


ATTTTTGATG 


TGTTAGGTCT 


TCCAGAGCAA 


480 
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GTTAGCCTAT TATCAGGAAC ACTACGAAAA ATTTCTCAAA AGTTAATTTT CCCAAATAAG 540 
GGAAAAAATA AAAAGCTCCT TTCGGTTTTG ATACAATAGT TTCAAAATTT GAGAGGAG 598 

(2) INFORMATION FOR SEQ ID NO: 140: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

CTACTTCCCG GCCGAGTCGA TTCTCCGCCG ATATCGAGCA TACTGGCTCC TTCTGTTATC 
AATTTACGAG CCTGCTGGAG CGCCTGCTCA AGAGCAAAAA A1TCACCACC GTCCGAAAAG 
GAGTCTGGGG TTACATTGAT AATTCCGCAA ATAGCTGTCT TTGCATGAGT GGCTTTACTG 
GACATATCGG TCACTCCCTC AAGGCTATTC ATCATATTAT TTCTCTATTT TACCATAAAA 
AGAAAAAGAT GGACACGATT TCATTCATCT TTCTCCCAGT AGAAACAAGT AAGCAATTGT 
CAATAATCTT AAACAGAAPT CCCTAATGTC CGACTCATAA TCACCACAAG AGCCAACAAA 
CAGAAAGCAA tcccatt: ,C AATCATGTGA AGTAAGATCG ACATTTCCAA ACGTTGGGTC 
TTGTAGGCTG TCCAAGATAG AACTGTCGAC ATACCTCCAT AAATCAATAA AGAAGGTAAA 
TTACTTGGTT GATGTAATAA AGCAAACACA ATCGTACCGA CTACAAAT 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
528 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 : 



CTATGAGAGA GAATGACCTT CTCTTGATTA CTGCGGACCA TGGAAATGAC CCAACGTATG 
CAGGAACGGA TCACACTCGG GAATATATTC CATTGTTGGC CTATAGCCCT 

TTGGTGTGGA AACTGCTATG ATTGGGGAAA GTTTCTTAGA TAAATTGGTA TAAGATGACG 
CGCTATGCTT TGCTGGTGAG AGGTATCAAT GTTGGTGGTA AGAATAAGGT C "caScG 
GAGCTTCGTC AAGAATTGAC AAACTTGGGA CTGGAAAAAG GTTGAGAGCT AcItC^AG 

tggcaatatt rrccrrTACT tcgatagatt ccaaagcccc aat^g^ aZgc"!^ 



60 
120 
180 
240 
300 
360 
420 
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GACTTTCTTT GCAGTCCATT ATCCATTTAT TCCAGAACTT TTTCCTTTAC TGAGTCTAGA 
GGACTTTGAA GGGCGGAACT TGAAA 



480 

505 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 



CTATTTTCTC 


ATCTTTTTTA 


GACTGGTAAT 


GTATCCCCTA 


GAGTTTCATT 


GTTAAGACCT 


60 


GTCGTATTGA 


CAACAGTGGA 


TCCTGATTCC 


CCATTCCAAG 


AGTTTTGCCC 


GCATCATATN 


120 


GACGAAATCT 


TTTTCTGAGC 


CAGCAATTTT 


CTCAGCTAGG 


GCAATAGCGG 


CGCTGTTGGC 


180 


ACTAGATACC 


AGAGTTGCTT 


CAAGCAACTC 


TTCGACAGTA 


TAATTACGGG 


CCTCCATAGG 


240 


AATATTACTG 


GCTTCAGAAT 


TTGTCGTCAA 


TTGATAAGGA 


TAATCAGAAA 


TATCTACAGG 


300 


AGTGGAGAGG 


GTAATACTTC 


CGTTTTCCAA 


AGCTTCATAG 


ACCAGATAAA 


CAGTAATCAA 


360 


TTTTGTTATG 


GAAGCAATTT 


CGACAGGTGG 


CGTTGCATCC 


TTCTCATAGA 


GAATTTTACC 


420 


AGTATTTGCC 


TCAACAGCAA 


TCGCATGTTT 


AGCGGCAATG 


GTAAAATCTT 


GAGCAACAGC 


480 


AGTAGAAGCA 


CCCCCTAAAA 


GAGAGACAGT 


TAACAAAGTT 


AAAAATATTT 


TTTTCATAGT 


540 


AGTCTTATTC 


TATCATAAAG 


AAAAAAAATA 


TTCTTGCTTT 


AATAATTCAT 


CTGTTAAGCT 


600 


TTTTGAAAAT 


ATGGTAAAAT 


AAAGTGAGGG 


AGGTAACTCA 


TGTTTCGTAG 


AAATAAATTA 


660 


TTTTTTTGGA 


CCACAGAAAT 


TTTACTCTTA 


ACCATCATCT 


TTTACCTATG 


GAGACAGATG 


720 


GGGTCTTTGA 


TTAACCCTTT 


TGTTAGCGTG 


CTTAATACAA 


TTATGATTCC 


ATTTTTATTA 


780 


GGGGGCTTTC 


TTTATTATTT 


GACAAACCCT 


ATTGTTACTT 


TCTTAAATAA 


AGTCTGTAAA 


840 


CTCAATCGTT 


TGCTTGGTAT 


TTTAATTACC 


TTGTGT 






876 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
CTTGTCAAAT GCCATGGAAG GGGTTCTTTA CTTCCTCAAA CCAGACTTTT CAAAACTGAC 60 
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TAGCGCTGAT CTCCTCTATG CTCTGGGACA ATCTTTCTTT GCCCTCTCAC TAGGGGTTAC 120 

AGACATGTTG ACCTATGCTT CTTACTTGGA CAAGAAAACC AATCTAGTCC AGTCAGGAAT 180 

CTCCATCGTA ACCATGAATA TCTCGATAGT CCATCATGGA AGGTCTAGCC ATTTTCCCAG 240 

CCATGTCAGC CTTCAATATC CACTCTGAAA GGGGACCCAG CCTGCTCTTT ATCGTCTTGC 300 

CTCAACTCTT TGACAAGATG CCTTTTGGAA CCATTTGCTA CGTCCTCTTC CTCTTTGCAA 360 

CTGTCACTTC TTCTGTCGGT GATGCTGGAG ATCAATGTGG GCAATGTCAC CAACCAGGAT 420 

AACAGCAAAC GTGCCAAATG GAGTGTTATT TTAGGAATTT TGACCTTTGT CTTTGGCATT 480 

CCTTCAGCCC TATCTTACGG TGTCATGGCG GATGTTCACA T 521 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



r 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



143 



60 



3CAGACAA TAACTTGGTT AGCTTAACGG CTCTTGAGGA TTCATCCAAG GATGTAACCT 

TTACCAGTTC GGCTTTCAAT CTAAAAGAAG GGCGACACCT TCAAAAAGGG GATTCCAAGA 120 

AAATCCTTAT CCACGAAGAA TTGGCTAAGA AGAACGGTCT TTCACTTCAT GACAAGATTG 180 

GCTTGGATGC TGGTCAGTCT GAATCTGGAA AAGGACAAAC AGTAGAGTTT GAGATTATCG 240 

GCATCTTTCC TGGTAAAAAA CAAGAGAAAT TCACAGGCTT GTCTTCTGAC TTCAGTGAAA 300 

ATCAAGTCTT TACAGATTAT GAAAGTAGCC AAACCCTTTT GGGCAATAGT GAAGCTCAAG 360 

TCAGTGCAGC ACGCTTCTAT GTTAGAAAAT CCTAAGGAAA TGGACGGACT CATGAAGCAG 420 

GTAGAAAACT TGGCCTTGGA AAATCAAGGC TACCAAGTCG AAAAGGAAAA CAAGGCTTTT 480 

GAACAAATCA AAGACTCAGT TGCAACTTTC CAAACCTTCC TGACCATCTT CCTTTATGGA 540 
TGTTGATAGC AGA 
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CTATGGATGC GGAAGTTTCT AAGAACCTTC GCTTGATCCT TGAGCGTAAA GGAATGACCA 60 

TCTTGACTGG TACTAAACTG CAAGAAATCA TTGAGGAAAA TGGTCAACTT CGTATCAAGG 120 

TTGAAGGAAA AGACAATATC ATCGCAAGCA AAGCTCTTCT TTCAATTGGT CGTATGCCAG 180 

ACCTTGAAGG TATTGGAGAG GTTGAGTTTG AATTGGATCG TGGTTGTATC AAGGTCAATG 240 

AATACATGGA AACTTCAGTT CCACGCATTT ATGCCACCAG GTGACATCAA CGGTACTAAG 300 

ATGTTGGCTC ACGCAGCTTT CCGCATGGGT GAAGTTTCCG CTGAAAATGC CCTTAAAGGA 360 

AATCATGCAG TTGCCAAATT GAATTTGACT CCTGCAGCCA TCTACACTCT CCCTGAAGTA 420 

GCAGCAGTAG GTTTGACAGA AGAACAAGCC CGTGAGAAAT ACGATGTTGC CATCGGTAAG 480 

TTTAACTTTG CTGCTAACGG TCGTGCTATT GCATCTGACG CAG 523 

(2} INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



CTGCAGGTGG 


AATTATGAAT 


GTCGGATATG 


AAAAAGCATT 


CTTGATGCAG 


ACATCGTTAA 


60 


ATTTGCCAAC 


TTCTGAAATT 


ATCTCGACAT 


ATGTCTATAA 


AGTTGGTCTT 


GTATCAGGAG 


120 


ACTATTCTTA 


CTCAACAGCG 


GTTGGTTTGT 


TTAATGCAGT 


GATTAACGTA 


GTATTGCTTG 


180 


TTGCAGTTAA 


CCAAATCGTT 


AAACGCATGA 


ATAATGGTGA 


AGGAATTTAA 


GGAGGAAAGT 


240 


ATGAAAAATT 


CGATTATGGA 


TACAAAATTT 


GATAGACGTA 


TCTTACTCTT 


AAATAAAATC 


300 


ATTATTGTCT 


TTATCGTTTT 


GATGACTTTG 


CTTCCTTTAC 


TTTATATCGT 


CGTAGCATCC 


360 


TTTATGGATC 


CTAAGGTTCT 


GGTTAGTAGA 


GGGATTAGCT 


TTAATCCAGC 


CGATTGGACT 


420 


GTAGAAGGTT 


ACCAGCGTGT 


TATTCAGTGA 


CCAATCTATT 


CTAAGAAGTT 


TTATCAATTC 


480 


CCTACTATAC 


TCTTTTGGAT 


TTGCAGCT 








508 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
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CTAAAAAGAA CCAGATGGAT CTTGTGATGA ATTTACATCA TTTTGATTTA CCAGTGGAAC 60 

TTCTTCAAAA ATACGGTGGT TGGGAAAGCA AACATGTAGT GGAGTTATTC GTGAAGTTTG 120 

CCAAGACTGC TTTCAACATG CTTTGGAGAT AAGGTTCATT ACTGGACAAC TTTCAATGAG 180 

CCAATGGTCA TTCCAGAAGC AGGATACTTA TATGCTTTCC ATTATCCAAA TCTAAAAGGA 240 

AAGGGAAAAG AGGCCGTACA AGTCATCTAT AATCTAAACC TTGCTAGTGC AAAAGTGATT 300 

CAACTATATC GCTCATTAGG ACTTGATGGA AAGATTGGGA TTATTTTAAA CTTGACACCT 360 

GCTTATCCAA GAAGTAATTC TCCAGAAGAC TTAGAAGCAA GTCGATTTAC AGATGACTTC 420 

TTTAACAAAG TCTTCCTTGA ATCCAGCTGT TAAAGGAACT TTCCCAGAAA AGATTGGTAA 480 

AAACAGCTAG AGAGAGATGG CGTGTTATGG AGTCATACCG AAAAAGAG 528 

(2) INFORMATION FOR SEQ ID NO: 148: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 



ACTCTTATTG GTGAGGTATC GCAACTTCTG TTTCTGCCTT CCTTGGTGGA CCAGCCAATA 60 
CAACTTACGG AGAAAATACA GGGGTTATCG GTATGACTCG TATCGCTTCT GTCTCAGTTA 120 
TCCGTAACGC TGCCTTCATC GCGATTGCCC TCAGCTTCCT TGGTAAATTC ACTGCCTTGA 180 
TTTCAACTAT TCCAAACGCT GTACTTGGTG GTATGTCAAT CCTTCTCTAT GGGGTTATCG 240 
CCAGCAATGG TTTGAAAGTC TTGATTAAAG AACGTGTGGA TTTCGCTCAA ATGCGAAACC 300 
TCATCATCGC AAGTGCTATG TTGGTTCTTG GACTTGGGAG GAGCTATCCT TAAACTTGGT 360 
CCAGTTACAC TTTCAGGTAC TGCCCTTTCA GCCATGACAG GAATCATCTT GAACTTGATC 420 
TTGCCATACG AAAATAAAGA CTAAGAGTCT AAATACACCT AATCCACTCA GACAGCTGAG 480 
TGGATTTTTC GTATACCATA ATAAAAGTGT CTTAACAAAA TTATTAAAAT CAAAAAACGT 
ATAATATCAG ATATTCTAAA ACCTTGATAC TGTACGTTTT ATCATAGAAA TTTTTACTTT 
ATTTTCTCAT CAAATGAGAT TTGCATCAAT CTCTTGTCTT ACTTGCGTTT CTTCTTCGCT 
TTCTTCATTT TGTTAGCCAT ACGTTTCATG GACTGTTTCA TGGCAAATTC ACCAATTTTA 
CCTTTCAAAC CGCCACCAAA CATCTGGCTC ATATCTGGCA TTCCTGCTCC TCCGAGAG 



540 
600 
660 
720 
778 



(2) INFORMATION FOR SEQ ID NO: 149: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: G nomic DNA 



PCT/US97/05306 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: ; 


149: 






GGGAGTTGGT 






GTTTGAACTT 


TGAAATCGAA 


GTGGATGGTG 


60 


GGATTGATGA 






AAGAAGCTGG 


TGCGACTGTT 


TTTGTAGCAG 


120 


GTTCCTATGT 


CTT TAAGGG A 


GAAGTCAATG 


AGCGAGTACA 


AACTCTCAGA 


AAACAACTGG 


180 


ACTAGGGTTG 


CAGTTTTTGC 


AGGCGGAAAC 


CGCGGTCATT 


ATCGGACAGA 


TTTTGATGCT 


240 


TTTGTTGGGG 


TGGATCGAGG 


CTCGCTCTGG 


GTCTTGGAAG 


AAGACTTACC 


TCTTGCTCTA 


300 


GCAGTCGGAG 


ATTTTGATTC 


TGTGACGGAA 


GAAGAGCGAC 


AGGTGATTCA 


AAAACGTGCC 


360 


CAGTATTTTG 


TCCAAGCCCG 


GCCAGAAAAA 


GATGATACTG 


ATTTGGAATT 


GGCTCTCTTA 


420 


ACCATCTTTG 


AACAAAATCC 


TCAGGCTGAG 


GTCACTATTT 


TCGGTGCCTT 


GGGTGGCCGT 


480 


ATTGACCATA 


TGTTGGCCAA 


TGTTCTTTCT 


ACCTAGCAAT 


CCCTAAGTTG 


GCACCCTATA 


540 


TGCGTTCNAA 


TAGAAATTGA 


GGAATGGGCA 


AATTTGATTG 


CCTATTGTTC 


CAGAAGGGAA 


600 


CAGTCAGATT 


CGAATCCCTG 


CA 








622 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



CTATTATCAG 


CACTTTTATA 


CCTATTACCA 


AAAAAATCCT 


CAAAATGATT 


GGCAAAAGGC 


60 


TTTATCCACC 


AGCCATTTAT 


CAGCAATATT 


TACTTGAAAA 


ATATGGTAGA 


AATAGAAAGG 


120 


ATGGAGGAAT 


CTAATGGTAT 


TACAAAGAAA 


TGAAATAAAT 


GAAAAAGATA 


CATGGGATCT 


180 


ATCAACGATC 


TACCCAACTG 


ACCAGGCTTG 


GGAAGAAGCC 


TTAAAAGATT 


TAACAGAACA 


240 


ATTGGAGACA 


GTAGCCCAGT 


ATGAAGGCCA 


TCTCTTGGAT 


AGTGCGGATA 


ACCTACTAGT 


300 


AAATCACTGA 


ATTTTCTCTT 


GAAATGGAAC 


GCCAGATGGA 


GAAGCTTTAC 


GTTTATGCTC 


360 


ATATGAAGAA 


TGACCAGGAT 


ACACGTGTAA 


GCTAAGTATC 


AAGAGTACTA 


TGCCAAGGCC 


420 


ATGACACTCC 


TACAGCCAGT 


TAGACCAAGC 


CTTTTCATTC 


TATGATCCTG 


AATTTATGGA 


480 


TATTAGCGAA 


AAGCAGT 










497 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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CTACACTTGA GGGAAGTTTG AAAATTCCAT TTTTCTTAAA GGGAATTGTG AAACGCTATG 
TATTTTCTTT TTACAACCGG ATGGAGCACT TGGTTGTGGT CAATCCTATG TTTATTGAGG 
ATTTGGTAGC AGCTGGTATT CCACGTGAAA AAGTGACCTA TATTCCTAAC TTTGTCAACA 
AGGGAAAAAT GGCATCCTCT ACCACAAGAA GAGGTAGTCA GACTGCGCAC AGATCTTGGT 
CTTAGTGACA ATCAGTTTAT CGTAGTAGGT GCTGGGCAAG TTCAGAAACG TAAAGGGATT 
GATGACTTTA TCCGTCTGGC TGAGGAATTG CCTCAGATTA CCTTTATCTG GGCTGGTGGC 
TTCTCTTTTG GTGGTATGAC AGATGGTTAT GAACACTATA AGAAAATTAT GGAAAATCCC 
CCTAAAAATT TGATTTTTCC AGGCATTGTA TCGCCAGAGC GGATGCGCGA AATTGTATGC 
TCTAACGGAT CTTTTCTTGT TGCCTAATTA CAATGAG 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 798 base pairs 

147 
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<ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CTGTTATGGA GTTGAAAGAA GAATACCACC CAGACGGTAA AGGTTTTGAA ATGATCTGTA 
TCGGTGGGAT GGGAGCTGAT TTCTTTAAGG CTCGCGGTAT TCAACCACTT TATGAATTAC 
GTGGCTTGTC AGACCAACCT AGCTTTGATC AAGTTCGTAA GATTATTTCA AAAACTGTTG 
AAATGTACCA AAATGAACTC TTTGATGAGC TTTATGTTTG CTACAACCAC CATCTCAATA 
CGCTAACCAG TCNAATGCGT GTGGAACAAA TGCTTCCGAT TGTTGACTTG GATCCAAATG 
AAGCGGATGA AGANTACAGC TTGACTTTTG AAATTGGGAA ACCAGCCGAG AAGAAATTCT 
GGAGCAGTTG TTGCCTCAGT TTGCAGAAAG TATGATTTAC GGTGCCATTA TCGATGCCAA 
GACAGCTGAG AATGCTGCGG GTATGACAGC CATGCAAACA GCGACAGATA ATGCTAAGAA „ y 
AGTCATCAAT GATTTGACAA TTCAGTATAA CCGTGCCCAG ACAGGCGGAT ATTACACAAG 540 
AAATTACCGA AATCGTAGCA GTGCCTAATG CCTTAGAATA GCTCTAGTCC CAGCTCTCCT 600 
CACTC 

605 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



60 
120 
180 
240 
300 
360 
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(B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



TGTTCGCCTC 


CATATCCNAA 


ACTTGCCCTG 


CCACCATACC 


CAGACTACCT 


GAAGCAAGGG 


60 


ATAAGTTGGC 


AATCGAGCCC 


ACCTNGATCT 


GACTTGGCGA 


ATCTGCCTGC 


GCAATCAAGG 


120 


CATATGGGTC 


TAAGAATAAG 


GATCTCCAGC 


CAAAATGGCC 


ATAGCTTCAC 


CGAATTTCTN 


180 


GTGATTGGTT 


AACCGCCCTC 


TTCGATAATC 


GTCATCATCC 


ATAGCAGGAA 


GGTCATCGTG 


240 


AATCAAGCTC 


CCTGTATGAA 


TCATCTCCAA 


GGCAGTAGCT 


ACCTGCGCGN 


GAGCAGGTTT 


300 


GATGGTAACC 


TGCAAGGCTT 


CCAGAACTTC 


TAACAAGAGA 


AAAGGCCGAA 


TACGCTTGCC 


360 


ACCAGCATGA 


ATAGAATAGA 


GAACAGACTC 


CCGTAAACTA 


GAGGCAAACT 


QCTGGTCTCC 


420 


ATAAAATCTT 


CCAAAGCCGA 


CTCGACAAGA 


GCTAATTTTT 


CTTGCTTTTT 


CATTCAAAAT 


460 


CACTTTCTGT 


TCCGTCTTCT 


TGCATGACCT 


TGACCAAGGT 


CTTTTCAGCC 


TTGTCCAGCG 


540 


TAGCTTGGAG 


CTCTTTTGAC 


AAGACCATGC 


CCTTTTGAAA 


GGCAGTAATC 


GCATCTTCCA 


600 


GAGCAATTTC 


ACCATTTTCC 


AAACTTTGGA 


CAATGGTTTC 


CAGTTCTGCT 


AGATTTTCCT 


660 


CAAATTTCTT 


TTGTTTTGAC 


ATCTTTAACC 


TCTAATTCTA 


CTTGACCATC 


TCGCATCAAA 


720 


AGCGTTACTT 


GGTCTTTTTT 


CTTC? iACTC 


TCAACCGAAT 


CTACAACGGA 


TCTTCTTTTT 


780 


TGACAATAGC 


ATAACCAC 










798 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



CCATGTTTGC 


GGCAGTGACA 


GCAGGTTATC 


AGGCTGCCCT 


AATGGTACCA 


ACAGAAATCC 


60 


TCGCAGAGCA 


ACACTTTGAG 


AGTTTACAGA 


ACCTTTTTCC 


CAATTTGAAA 


CTGGCTCTCT 


120 


TGACAGGTTC 


CTTGAAAGCT 


GCAGAAAAGA 


GAGAAGTCTT 


GGAGACCATT 


GCCAAGGGTG 


180 


AGGCTGATTT 


GATTATAGGA 


ACTCACGCTC 


TGATACAAGA 


TGGGGTGGAG 


TATGCTCGTC 


240 


TTGGTTTGAT 


TATTATCGAT 


GAGCAGCACC 


GTTTTGGTGT 


AGGGCAAAGG 


CGTATTTTAC 


300 


GGGAAAAAGG 


CGACAATCCA 


GATGTCCTCA 


TGATGACGGC 


GACTCCCATT 


CCACGGACGC 


360 


TTGCCATCAC 


AGCCTTTGGA GATATGGATG 


TTTCCATTAT 


CGACCAGATG 


CCAGCAGGTC 


420 


GGAAGCCCTA 


TTGTGACGCG 


CTGGATCAAA 


CATGAGCAAC 


TACCTCAGGT 


CTTGACTTGG 


480 


TTAGAGGGGG 


AAATTCAAAA 


AGGTTCCCAA 


GTCTATGTCA 


TCTCTCCTTT 


GATTGAAGAA 


540 



148 



WO 97/37026 

TCAGAAG 



PCT/US97/05306 



547 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CTCTTTTAAC CGTTTTAGCG GTGACACCGA GGTATTTTTT CAGGACCCAA GACTTGTCGG 60 

GCAACCGAAA CTGGGAGTTC GTCATCTCCA ATATGCAGAC CAGCAGCATC AACCGCAAGA 120 

CAAACATCCA ACCGATCATC GATTATCAAG GGGACCTGAT AGGCATCTGT TATTTCCTTG 180 

ACTTGTTTTG CCAGTTGATA ATATTGATTG GTTGTGAGAT TTTTTTCTCG CAATTGGACT 24 0 

ATGGTAACCC CTGAACGGCA GGCCGTCTCA ACTTTTGCAA GAAAGCTTTC CACGGAATCT 300 

TGATAGCGAT TGGTTACCAG ATATAGTCTA AGCGCTTCTC TATTCATAAA CCTCTCCTTT 360 

GATGGTATCT AGCCAATTTT CATCTCTTCT TAGGAGCGAA AGCTGATTGA GTACTTGGTA 420 

ACGAAATTCT TCCAATCCCA TTCCTTGAAC AACTATTTTC TCAGCCAGCG ATATTGAGAT 48' 

AAGAGACTGC TAAGCAAGAA CTTCAAAACC AGTCTTTCCT TGGCTGAGAA AAACAG 536 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 : 

CTGGAGTGAC GATTGAGAAA ATCTTTGCCC TAGAAAATTA TCGAAATCAG TTAGCTGCTT 60 

TTCCGCAAAC TGTCTGGATT TCAGAGGATA TTTTGCTAGA TTTGGCGGAT TCTCAGACTC 120 

CACAGGGAAT TGTTGCCGTG GTTCAAAAAG AAGAAGTAGG ACAAGCTGAT TTGAGTCAGG 180 

GCAAGTTCTT GTTTTTGGAA GATGTGCAAG ATCCTGGTAA TGTAGGAACT ATCATTCGAA 240 

CTGCGGATGC AGCAGGTTTT ACAGGAGTGA TTGTTTCAGA TAAGTCGGCA GACATCTACA 300 

GTCTTCAAAA CCCTACGTTC CATGCAAGGT AGTCATTTCC ATCTGCCCAT TTACCGGATG 360 

ACTAGTCAAG CGCTTCTTGA CGAAACTAAA AAGGTAGCTA TCCCAGTGCT AGCAACAACC 420 

CAATCTAAAG ATTCTGTTGA TTACAGAGAA CTGCCTTCTA TAGAAAATTT TGTACTAGTT 480 
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ATGGGAAATG AGGGTCAAGG AATTAGTCCC CTTATGGCTG AAAGTGCAGA CCAGTTGGTC 540 

CATATTAGCA TGAAGGGGCA GGCCGAGAGT TTGAATGTTG CGGTTGCAGC CGGTATTTTA 600 

ATCTTCCATT TAAGCTAATT TTAACTTTTT TTGTTATAAT CAAGGAAAGA TGTTCACAGA 660 

AAAGGAGAAA TGGATGAATC ACACTATTAT ACATGACCGT GCAGGTCTCA ATCAATTTTA 720 

CGCTAAGGTT TATGCCTTTG TTGGTCTGGG AATCGGACTA TCTGCTTTGG TATCAGGCCT 780 

TATGTTGACG GTCTTTCAGT CTCAGTTGGT TTACTTTTTG ATGCAGGGGC GTCTCTGGTT 840 

GACCATTGCT ACTT 854 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 157: 



CTGAAGAAAT 


CCTAP \AGAT 


TTGTATTACG 


GAGCCGCTAA 


GAAAATTCAA 


GAAGAAGCCT 


60 


CTCGCTGGGC 


agg; ",TTGTA 


AGAAATGACT 


AAAACAGCCT 


TTTTATTTGC 


TGGTCAAGGT 


120 


GCCCAGTATC 


TACGGATGGG 


ACGGGATTTC 


TATGATCAGT 


ATCCGATTGT 


CAAAGAAACG 


180 


ATTGATCGAG 


CGAGTCAGGT 


GCTCGGTTAT 


GATTTACGTT 


ATCTCATCGA 


TACGGAAGAA 


240 


GACAAACTCA 


ATCAGACCCG 


CTATACGCAA 


CCAGCCATTC 


TAGCGACTTC 


GGTTGCTATC 


300 


TACCGTTTAT 


TGCAAGAAAA 


GGGCTATCAC 


CCTGATATGG 


TTGCTGGTTT 


GTCTCTTGGA 


360 


GAATACTCTG 


CCTTGGTGGC 


AAGCGGCGCC 


TTGGATTTTG 


AAGATGCCGT 


TGCCTTGGTA 


420 


GCTAAGCGTG 


GAGCCTATAT 


GGAAGAAGCG 


GCTCCTGCTG 


ACTCTGGCAA 


GATGGTAGCA 


480 


GTTCTCCATA 


CCCCGTAGAG 


TCCTTGAGAA 


GCCTGTCCAA 


AGCTCTGACT 


TGAGTGGTAC 


540 


TCCAGCCACT 


ATACACACTG 


CACAATCTCA 


TTGCTGGAAA 


ATTGTTGCAG 


TGATCAACGG 


600 


TGACTTTGCA 


GAGCAGTGCC 


AACCCTATCC 


CCTAGT 






636 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



150 



WO 97/37026 PCTVUS97/05306 

CTCTAAAACG AATTCTCTAC AAGCACCGCA AGGGCATGGC TGAACTTCCA CCATAAGGTG 60 

GTTTGTCTCG AAAGGCTAAT ACTTCNTTAA CCTTAGTTTG TCCTGAAAAT TGGTACATAT 120 

TGAAGAGGGC CGCCCGTTCT GCGCAGAGAT GGAAAACACC ACAGGTTCCC TCCATACAGA 180 

ATCCTGTAAA TATTTGTCCA TCTCCTGCTT CTACTGCAGC TACAACATGA TTGGCATAAA 240 

CAAAGTCTGA TACTTCATGT GGATTGTATA GTTTCTGTGC TTCTTCCTAC ATCTTTTCCC 300 

AGATGTCCAT TATTGTATCC TCTATATTTA GAGATTTCTT TTAGAATGTT TTCGATATGC 360 

TGAATTGATT TTTCACGTCC AAGCAAGAAA ATTGTATCTG GTAATTCTGG CCCATGCATT 420 

TCGCCTGAAA CTGCGATACG AATAGGCATG AAAAGATTTT TCCCTTTAAT ACCTGTTTCT 480 

TTTTGGACTG CTTTAATTTG CGGGAAGATA TTTTCTGTCA CAAATTCATC ATCTGTCATC 540 

CGCTTCAATT TTTGCTTTGA ATGCTTCAAG AACTGTTGGA ACTGTTTCAC CCGTCATGAC 600 

TTCGCGCTCT GCTTCTGTCA ATTCCTGGGA AATCTGAGAA GAAAGATCTG TCCATGGGAT 660 

ATCTCATCTA CTGATTCATT GTGGTTTATA GAG 693 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CTGACAAAGG AGACGGGTAT GGAACAAACA TTCTTTATCA TCAAACCAGA TGGTGTAAAA 60 

AGAGGGCTAG TGGGTGAAGT GTTAAAGCGC ATCGAACAAC GTGGATTTAC AATCGAAAAA 120 

TTGGAGTTTC GTTCACAGGT TTTCAGAAGA GTAGATTGAC CAGCACTATC AGGACCTGGT 180 

TGGTCAGAGT TTTTACCCAC CGATTCGTGA ATTCATGACT TCAGGTCCAG TTCTTGTGGG 240 

TGTCATTTCT GGTCCCAAAG TAATCGAAAC TTGGCGGACC ATGATGGGTG CAACTCGTCC 300 

AGAAGAAGCT TTACCAGGCA CTATTCGAGG TGATTTTGCA AAAGCTGCTG GAGAAAATGA 3 60 

GATTATCCAA AATGTTGTAC ATGGTTCAGA TTCCGAGAAG AATCAGCTAA GCCGAGAAAT 420 

TGCTCCTTTG GTTTTAAGAG TGGATTGGCT CAATCAATTG GATAAAAGCT CATTTGAATA 480 

GAAAGTATAG TCAATTAGTT TAAGACATGA CGCATGATAT CAAACTTTTT AGTTTTTGAT 540 

ATGGTGCGTT TTT 553 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



151 



WO 97/37026 

(ii) MOLECULE TYPE: Genomic 



DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



CTAGAGATTT 


NCGAAGAGTA 


TTATTGTACT 


TTAAAGGTCT 


TGAGATAATT GTCTTTNCCT 


60 


ACTTGACCTT 


CGAAGGTTTT 


ACCATTTTCA 


AGGTAAGGAA GGTCATCGGA TACTGAAGCC 


120 


TTGACCTTGT 


ATATCTTGCC 


ATCAACTTTA 


AAGAAGTAGA 


CAGTGTCTCC CTTGATAACA 


180 


GCTGATTTGA 


GGTCTGCTAC 


TACTCCCTTG 


ATGCTTTCTG 


TCGTTGCATT GTCAATTTCA 


240 


AGGTCGTTTT 


TATTGGCATA 


CTTGCTGAGC 


ATCTCTTCCA 


CTGTAGTAGC AACGATAACA 


300 


TTTTGGTACT 


CGACTGCGTC 


TACCAGGGCG 


TACTCTTTGA 


CCAAGCCAGC ATTGTCCTTT 


360 


AAGCCCATGA 


TGTAAAGAGG 


CTTGTCATTG 


AGGTTGATAA 


AGATTGGGGA AAGGTTGCTT 


420 


TGTAGGATTT 


CTCCTGAACA 


GCACCTTCTG 


CTGATTCACG 


GGCTGATTCT TCTGTCGCAG 


480 


AAGCCAAG 










488 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



TACAGCTATC 


GGGTCCGTCT 


GGTAGTTGGT 


CCTCAGAACC 


TTCACTGCCA 


CTTCTTCCCC 


60 


ATCTAAGATT 


AAGTCTTTGG 


CTAGGTAGAC 


ATCCGCCATG 


CCTCCTCGAC 


CAATCTGTTT 


120 


GACAATCCGA 


TAGCGTCCGG 


CAAAAATCTT 


GCCGATTTGG 


ATCATTCTTC 


ATCCTCCTCG 


180 


TTCATAGAAA 


CAAGGGCAAC 


CGTAATGTTG 


TCTAAACCTC 


CTGCATTGTT 


AGCAAAACGA 


240 


ACAAGTGTCT 


CCGTTTTATC 


TGCTAAAGGA 


ATATCACTGG 


TTACAATATC 


ACGAATCTCA 


300 


CTGCCTGAAA 


TCATGTTGGT 


CAAGCCGTCA 


CTATTGAGCA 


AGAGATAGTC 


ACCTGACTCA 


360 


AGGATAACTG 


TCCCAAAATC 


AGGCTGAATT 


TCATCTTTTT 


GCCCAATAGA 


CTGGGTGATA 


420 


ATATTTTTTT 


GTGGATGACT 


TCTGCCTCTT 


CTGGTGTCAA 


TTGACCAGCC 


TTGAGCAATT 


480 


CATTAACCAA 


AGAATGATCG 


CTCGTCAACT 


GATGGTATCT 


TCTCCACGAA 


TCAAGCCGAT 


540 


ACG 












543 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

CTTTAAGGAA AATCAAATCT CTCATGCTGA TACCTCTCCT CATTAAATTA AATAGTAAAA 
AAGATTCTAT CTCACTCCCT GATTATTACA AAACCATTGA AATATCACAA CTAATAGGCT 
AGAATGGACA TAGTAAGATN TAGTAGATGA GTCATTCTAC TCAAATCCAC GTTAGAAAGG 
ACTGCTATGC CAGACAATCT CGCGCTTCGC ATGCGCCCTA AAACCATCGA CCAGGTCATC 
GGTCAGGAGC NTCTGGTCGG ACCTGGAAAA ATCATCCGCC GCATGGTGGA AGCCAACCGC 
CTGTCCTCCA TGATTCTATA TGGCCCTCCT GGAATCGGCA AAACCAGTAT TCCCTCTGCC 
ATCGCTGGAA CGACCAAGTA TGCCTTTCGA ACTTTCAATG CGACAGTTGA TAGTTAAAAA 
GCGACTGCAA GAAATCTCGG AAGAAGCTAA ATTTNCTGGT GGTCTCGTCC TATTGCTAGA 
CGANATTCAT CGACTAGATA GACC 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

CTAATAGAGG CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA 
TTTTACCAAC CTCTTCITCC CGTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC 
AGGAAACCTT GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGAAAATA 
CCAAGAGAAA TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT 
TGATTTTGCT TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA 
GGAATTATTG CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATGCC 
CAGGAGCAAG TTGTTTTGGG GAATACCTAT CATACAGGAA ACCATTGCTT CTCATgZg 
^GTTCTTGT TAAAAAGTGG AATTGGTAAG GTCA TCTCTC CTA^AATT* ^TGGCGAAT 
TTTGGCTGAT CATTTCCAGG TTGGATGCCC TTATTAATAC GGGTTCAG 

(2) INFORMATION FOR SEQ ID NO: 164 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



CTAATAAGAC 


TGAAAAGAAG 


TCCAGCCAGA 


TGTCCCTAAA 


AATACAGAAA 


AAACATTAAA 


60 


ACCAAAGGAA 


ATCAAATTTA 


ATTCTTGGGA 


AGAATTGTTA 


AAATGGGAAC 


CAGGTGCTCG 


120 


TGAAGATGAT 


GCTATTAACC 


GCGGATCTGT 


TGTCCTCGCT 


TCACGTCGGA 


CGGTCATTTA 


180 


GTCAATGAAA 


AAGCTAGCAA 


GGAAGCAAAA 


GTTCAAGCCT 


TATCAAACAC 


CAATTCTAAA 


240 


GCAAAAGACC 


ATGCTTCTGT 


TGGTGGAGAA 


GAGTTCAAGG 


CCTATGCTTT 


TGACTATTGG 


300 


CAATATCTAG 


ATTCAATGGT 


CTTCTGGGAA 


GGTCTCGTAC 


CAACTCCTGA 


CGTTATTGAT 


360 


GCAGGTCACG 


TAACGGGGTT 


CCTGTATACG 


GTACACTCTT 


CTTCAACTGG 


TCTAATAGTA 


420 


TTGCAGATCA 


AGAAAGATTT 


GCTGAAGCTT 


TGAAGCAAGA 


CGCAGATGGT 


AG 


472 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 



CTTTTAAATA 


AGAAGAATCA 


CACTCATTGC 


TAAATGTGAT 


ATAATATTGT 


TCTCAGTCTC 


60 


AGCAGTGCGT 


TGTGAAAAAG 


CCATACTAAT 


CCCTGATTTT 


TCCGCATATT 


CCATCACCTC 


120 


AAGTGTTTTA 


CTCACTGAGC 


CAACTTGATT 


AGGCTTTATT 


ACAATACCAT 


CAGCACAATC 


180 


TTTAAACTGT 


GAAATTCTTT 


CTAAGTTTGT 


TGCGTAAAAA 


TCATCACCAA 


AAACTTGTAA 


240 


CTTTAGAGGT 


TTTATCAATT 


GAAATTTTCT 


CCAACTATCC 


AAATCTTCAT 


CGGAAAATGG 


300 


ATCTTCAAGA 


TAAACCAATG 


GATACTTAAC 


TCCCCAATCA 


CAATAAGTAT 


CCATAATTTC 


360 


TGTGACTGTC 


TGTTGCTGAC 


TAACACACCA 


TGGTACTTTA 


TATATGCCAA 


GTGAAGAATC 


420 


GTAACGATCT 


GTCATTGCTA 


AATCTAAACC 


GATATCATAT 


CTATTTTTGT 


ATAATTTTAA 


480 


TGTCTCTAAT 


AACGAATCCA 


AAATAATATA 


AAAATCATCC 


GTGTTAATTA 


TTAATGCACC 


540 


TTGATAGAAA 


TAGATGTTGT 


AGAAACCTGG 


GTTTTAACTT 


TATCATAACT 


CATTTTTATG 


600 


TTAGA 












605 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 
TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG 
CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT 
AACTTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 
GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 
GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACC 
TGATGATTGG AGATTCTCTA ACCGCCGACA TTCAAGGTGG CAATAATGCG GGGATTGACA 
CTATCTGGTA TAATCCTCAT CACCTCGAAA ATCACACACA AGCCCAGCCG ACTTACGAAG 
TCTATTCTTA CCAAGACTTG CTGGATTGTT TAGATAAAAA TATTCTTGAA AGATCACGTT 54 0 
TTAAGGAGAT AG 



(2) INFORMATION FOR SEQ ID NO: 167 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic a-:d 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 



C^TGTCTGT ATCTGCCTTT TTATGCTATA CTTAAGGTAT GCATAGAAAA ACAGTGATTG 
ATTNTAGGAC TTTAGGGGAG AGATATACCT TTACCCANCC TATTAAAGAG TTGAAAACGA 
GAAATGTAGC AGAAGTGGCA GATTTGCTGG CACNAGTGGA AAGCTACCAA GAGCAAGATT 
ATTATGTGGT GGGGTATGTC AGCTACGAGG CTGCACCTOC TTTTGAGGAG AAATTAGCAG 
TTCACAAGGT TCCTCTACTG GGCGAGTACT TGCTTTACTT TACTGTTCAC GATAGGGTGG 
AGACCTCCCC TATTCCTCTG ACTTATGAGG ATATTGATTT GCCCTCAAAT TGGCAGGGAA 
NTAACGTCTG CACAGAACTA TGANAAGGCC ATTGCCCAGA TACAC TGGCAGGGAA 



(2) INFORMATION FOR SEQ ID NO: 168: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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***** «~ °»«" "* 

TCCTTTTT CC ATC^ATC GTCCATCTGT GAAAACA™ wTG MTCM 
(3AGGCCGCCT ^ ATGGC TCAGA ^ ^cTTATCC 

ACGAGAACGA ^CrrCAATTC ^ GCCAACCCAC AACCTGAA ^ 

ctctttggaa atcggacggc ™£ taatatctac ataaatcccc 

iss ssu «- 

ggttttggtc cataggcatt 

LENGTH: 593 

(B) ^pe: nucleic acxd 

(C) stbaNDEDNESS: Single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: Genome DNA 

(xi) s ^ce D — o,. -o-.ie, 

C.CATCTGG — ^ = = 
— " G^TTAA AAAATGGCAA £££ ^„ T 

TTACAACAAA AGAATATAA qaaaMTA CATTAAATAA ^™ "JGGATTAGAT 

tagcaaatac caaaaagaac tagaaa^ TMAGGAG rr attgcaacag 

SSSt gaatttttgg 
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(i) SEQUENCE CHARACTERISTICS . 
(A ) LENGTH- 456 base pa.rs 
,B, TYPE: nucleic add 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



CCTGCTCCTT 


ATCTTTGCAG 


TAATTGGCGT 


TTTAGTGGCC 


TTGATAGCTC 


AATTTTACTC 


60 


AGCAAAGGCA 


GCAGTAGGTT 


TTGCTAAGGA 


ATTGACAAAC 


GATCTTTATC 


GTCATATTCT 


120 


TTCCTTGCCC 


AAGGACAGCA 


GAGACCGTCT 


GACAACTTCT AGTTTGGTTA 


CTCGCTTGAC 


180 


TTCGGATACC 


TACCAGATTC 


AGACTGGTAT 


CAATCAATTC 


CTGCGTCTCT 


TTTTACGAGC 


240 


GCCCATTATC 


GTTTTTGGTG 


CCATTTTTAT 


GGCTTATCGA 


ATCTCAGCTG 


AGTTGACTTT 


300 


CTGGTTCTTA 


GTCATGGTTG 


CCATTTTGAC 


CATTGTCATT 


GTTAGGGTTA 


TCTCGATTGG 


360 


TCAATCCTCT 


CTACAGTAGT 


CTCAGAAAGA 


AAACCGAACC 


ACTGGGTTCA 


NGAAACCCCC 


420 


CNCCATTGCA 


AGGATGCCGG 


GTTATTCCGT 


GCCTTT 






456 



(2) INFORMATION FOR SEQ ID NO: 171: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TO'-uLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:171: 



CTTGTACAGT 


TGAATCGTTC 


AGATATTGAT 


TTTGAAAACA 


GAGAGTGTGT 


TGTCTTTGGT 


60 


AAAGGAAAGA 


AGGAGAGACC 


AGTATATTTT 


GACGCTCGGA 


CGAAAATTCA 


TTTAAGAAAT 


120 


TATCTTAACG 


ACAGAAAAAG 


ATAGTCACCC 


TGCTCTTTTT 


GTAACGCTAG 


TTGGAAAAGC 


180 


CCAGAGACTT 


GGAATTGCTG 


GTGTAGAGAT 


TCGCTTAAGA 


AAGTTAGGAG 


ACAAACTCGG 


240 


CATACAAAAG 


GTTCACCCAC 


ATAAGTTCAG 


AAGAACTTTA 


GCGACTAAGG 


CAATTGATAA 


300 


AGGTATGCCT ATCGAACAAG 


TCCAAAAACT 


GCTAGGTCAT AGCAAGATTG ACACAACCCT 


360 


GGCCTATGCC 


ATGGTCAATC 


AAAATAATGT 


CAAGCATTCA 


CACCAAAAAT 


TCATCTCTTA 


420 


AAAGCAAATC 


CCGATATTCC 


GAAGAGGTGG 


TTGCATGAAG 


AAAGTGAAAT 


TAGGTGAAGT 


480 


GGCTACTTTT 


ATCAATGGCT 


ATGCTTTTAA 


ACCTCAAGAT 


TGGTCCTCTG 


AAGGAAAGAG 


540 


ATTATCCGAA 


TTACCGAATC 


TGACT 








565 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



CTTTATACTT 


TATCATTTCT 


AACTTAATTA 


TAGTCTTATT 


TTTAATAAAG 


TTCAATTATT 


60 


TATATGTAAA 


ATTCCTCATC 


AGTAAAAAAT 


AACCGATCTC 


ATTTCTGAGA 


ATCGGTTTTC 


120 


TAAATAAATC 


AAACCGATCA 


TTTACATAAC 


ATAAATTATG 


TAAATGATCA 


TACTACAACA 


180 


ACAAATCTTT 


GACTTTTCCA 


ATTTCACTTT 


TTGGAATAAC 


CAGGTGAATC 


ATATCACCCA 


240 


GATACATTCT 


GGTTGAGCCG 


TTAACTGTTT 


GGCTCTTGCC 


ATTATGGACT 


TGAGTTGTGA 


300 


TGAGGACGTT 


GTGTGGTAAG 


TTGAGTTCAT 


GAACTTGTTT 


CCCAGCAATT 


TTATCAGAAA 


360 


CTGGTATTTC 


GATAAGTGTA 


ACTTCTCCTT 


CGCTAGATAC 


TTCTTCTGGG 


AAGCATTTTT 


420 


TCCAGCATGG 


CTTCATAGAC 


TGGCGTTCCT 


TTGAGCNAAT 


CCCTGATNAT 


TTTAGAAACC 


480 


AGAGTGACAG 


ACCAGTGGCT 


TAAGGTGCGA 


ATATCTCCTA 


CCATCTCAGT 


TACGAGAT 


538 



(2) INFORMATION FOR SEQ ID NO:173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGTGCTAACC 


ATGAGAGCAT 


ATATTTCCTC 


CAACTATTTC 


TATTTCAAAA 


AATCCTTACT 


60 


ATATTGTATC 


ACAATCAGAC 


ACAAAAAGAA 


AAAGCAAATG 


ATAAACAAAT 


GCTTTTAANG 


120 


TTTTAAAAAA 


AGCTTCGAAA 


GGTTCTTCTT 


TATTTTTTAA 


NGGGAGAGAT 


AACGTTGATA 


180 


TCTAAATCGT 


GGTCAAAGCC 


GGCAATTTTC 


CTTTAGATGT 


GTATTGGTGA 


ATATCATAAT 


240 


CTAAATCAGT 


TTTAGGACTG 


CTCTCCAAAA 


ATCCTGAGTC 


TGAGCCGTAG 


GACGGAATCC 


300 


AAACAGAGGT 


AAACTTGCCT 


GTATCAATAC 


TGTGTTCTTC 


CATGAAGTAG 


ACACCAACGT 


360 


AGATGCCGAT 


GTTTTTAGCA 


CCTAGTGATG 


CTAGTTTTGC 


TCGAAAGTTT 


TCGACACCTT 


420 


CGTTCATATA 


GACATAGTTT 


TGTCTCCACG 


TCAGCCCATA 


GTAACTAGGC 


TGTTAGGAGA 


480 


GCACATTGTA 


GAAAACTTCG 


GCAGCCTTT 








509 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



CTGTTCTGTT TGCGACGCTA CTTGGTTGTA GCATGGGGGT GGTTTTAGAT GGTCAGTAAG 60 

TATCTTTTAT TAGCAGTTAT TTTCTCTGGC TTGGTGACTT GGATTCTCCG TATGATTTCC 120 

TTCATCTTAG TCAAGTATAA GGGCTTGCCT GCAATCGTTG AGCGTTTTTT GAAGTTCTTG 180 

CCCGTTTCCA TTATCTTTGC CTTGATTCTT TCAAGCGTAG TGACAGGTAA GGTTGGGAGC 240 

CTTCCTCAAA TTAAATGGCT AGACTTCTTA GCCGTCTTTC CAACAGCTTG GGTAGCCTTT 300 

CGCTACCGCA ATCTACTCGG AACAGTTCTC TTTGGAGTGG TCTTGATTGC CATCTTGCGT 360 

TTGGTCTCTT AAATTACCCA CCAAAAAAAC TTATCACAGA GATAGATATC ATATAATGGC 420 

GTAAATGCTC CTTTTCTGTT AAGATTATAA GGTATTCTAT TTTGGAGGAA ATGACATGAA 480 

AAAAATCGTT AAATACTCAT CTCTTGCTGC CCTAGGACTT GTTGCTGCAG TGTGCTTGCG 540 

GCTTGCTCAG GGGTGCTCAG AAAGAAGAGA AC 572 

(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


175: 






CTAGGTTATT 


TAGTAAAAGA 


CTAGAAGATT 


TTCTTTTCCC 


AATCGTCTTC 


TGTACGGCGA 


60 


GGATAGAAAA 


ACTCTGACTT 


GTCGGGAGCT 


TCCATCATTT 


CCATCAATGG 


TAGCATATCA 


120 


TAGGCCAGAT 


TTAAGTTTGG 


AATCTGGTCT 


TTTTGCACCC 


AAGAAACTTC 


TCCCTCTTCT 


180 


GAAGATTGAA 


GGGTACCAGA 


GAACTCAGTC 


GCCTTATAAC 


AAATGACAAT 


ATAGCGCCCA 


240 


CCTGTATCTA 


GTGGCCAATT 


TTTAATGCCG 


ACAAGTTGAG 


GATTTTGGAT AGTCAACCCT 


300 


GTTTCTTCGT 


AGATTTCACG 


AATGACAGAC 


TCCGCAAAAG 


CCTCATCATT 


TTCTACATGA 


360 


CCTCCAGGAA 


AGGCATAACC 


AGACCAGCGA 


TTGTTTTCAG 


GGGCGCGATA 


CTGCATCACC 


420 


ACGCGCTGAG 


TTTCGANGTC 


TTCCATCAGA 


CAGATATTTG 


TTTAAATTGT 


TTAATTGGGA 


480 


ACGGGACATA 


AATTTAC 










497 



(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CTGAAATTTT 


GGAAGAAGAA 


ATCGGTGTAG 


ATACAGTCCA 


AAAAATAGGA 


CGCATCTTGA 


60 


TTTTGTTTAA 


ACAATCTAGC 


AAGAAAGAAA 


ATCGCAAGAT 


TTCTAAGAAA 


GTCAAAGAAA 


120 


TCTAAGATCG 


AAACTCCAAA 


TAACTGTTTT 


TATAGAGAAA 


TAAAGGGGAC 


TAGCCTATGA 


180 


CAATCGAACT 


ATTGACTCCC 


TTTACCAAGG 


TAGAGTTGGA 


GCCAGAAATC 


AAGGAGAAAA 


240 


AACGCAAACA 


AGTTGGGATT 


TTAGGGGGGA 


ATTTTAACCC 


TGTTCACAAT 


GCCCATCTCA 


300 


TTGTTGCGGA 


TCAAGTACGG 


CAACAGTTGG 


GACTGGATCA 


AGTTCTGCTC 


ATGCCTGAAT 


360 


ACCAACCTCC 


TCACGTTTGA 


TAAAAAGGAA 


ACCATCCCTG 


AACACCATCG 


TCTCAAGATG 


420 


CTTGA 












425 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



CTGCTATCAA 


GACAGTATTA 


CCGCTGAAAG 


AAACAGCTTC 


ACTAGAAGGC 


GCAATTGAAA 


60 


AATAAGACGG 


GATCACGAAC 


TCCCGTTTTT 


CTATAAAAGA 


AAGGAAATGG 


GATGAAAAAA 


120 


TTAGTCTTTG 


TCTGTCTGGG 


AAATATTTGC 


CGTAGCCCTA 


TGGCCGAGTT 


TGTTATGAAA 


180 


TCAATGACAG 


ATAACTACGA 


AATCCAAAGT 


CGAGCAACTT 


CCTCTTGGGA 


ACATGGCAAT 


240 


CCGATTCATA AGGGGACTCA GGGAATTTTT 


CAAGAGTATG 


AGATTCCTTA 


TGACAAGAAC 


300 


AAGACATCGC 


TTCAGATTAG 


TAAGGAAGAT 


TTTGAAGCCT 


TTGATTATAT 


TATCGGAATG 


360 


GACGCTTCAA 


ATGTTCCGAC 


TTACGTCAGA 


TGTGTCCAGT 


AGACTGTCAA 


GATAAGAT 


418 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



ACTGTTTGAA 


AAATATGATT 


CGTGGTCGAG 


AAATGAATTG 


CATTTAAGCA 


ATGTAGTTCA 


60 


GTATATAGAT 


TTGGAAATTA 


ATGATTTAAC 


AAAATAAAGG AGAAAAAACA TGGTTAAATA 


120 


CGGTGTTGTT 


GGAGCAGGGT 


ATTTTGGAGC 


TGAATTGGCT 


CGCTATATGC 


AAAAGAATGA 


180 


TGGAGCAGAN 


ATTACTCTTC 


TCTATGATCC 


AGATAATGCA 


GAGGCGATTG 


CAGAAGAATT 


240 


GGGAGCAAAA 


GTTAGCAAGT 


TCCTTAGATG 


AGTTGGTTTC 


TAGCGATGAA 


GTTAGATTGT 


300 


GTTATCGTCG 


CAACTCCCAA 


ATAATCTTCA 


TAAGGAACCG 


GTTATTAAGG 


CTGCACAGCA 


360 


TGGTAAAAAT 


GTTTTCCTGT 


TGAAAAA 








387 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 



CTTATGAAAT 


CAATCATCAA 


AATAATATTG 


ATCAGGACTA 


TTTAGGTAAA 


TTATCTACAA 


60 


CGATTAAATT 


GGTAGCAGAA 


AAGGAAAATG 


CCGTTGAGAT 


CCTAGAACAC 


TTGAATGTTG 


120 


TCCCTGTGTT 


GACAGCCCAT 


CCAACACAAG 


TGCAACGCAA 


AAGTATGTTG 


GATTTAACAA 


180 


ATCATATTCA 


TAGTCTTTTG 


CGTAAATACC 


GTGATGTTAA 


GTTGGGGTTA 


ATCAATAAAG 


240 


ATAAATGGCA 


CAATGATTTG 


CGTCGTTACA 


TCGAAATTAT 


CATGCAGACA 


GACATGATTC 


300 


GTGAGAAAAA 


ATTAAAAGTG 


ACTAACGAAA 


TCACGAATGC 


TATGGAAATA 


TTACAACAGC 


360 


TCCTTTTTGA 


AAGCTGTCCC 


TCATTTGACG 


ACGGAGTATA 


AGCGCTTAGC 


GCAAGCGCAT 


420 


GGTCTGAATT 


TAAAACAGGC 


TAAACCAATC 


ACCATGGGTA 


TGTGGATAGG 


TGGTGACCGT 


480 


GAAGGAAATC 


CATTTGTTAC 


AGCAAAGAAC 


TTGAAGCAGT 


CTGCACTCAC 


TCAGTGTGAA 


540 


GTCNTCCTGA 


ACTTACTATG 


A 








561 



(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 



CGACCTAATC 


ATATTGATAC 


TGCATTTGCA 


TCAGCAGATT 


TGTTTGAGTA 


TAAATTACAA 


60 


TTAGCAGGAC 


AGACTTGGGG 


ATATTTAGAA 


TTTGAAACAA 


ATACAGAAAA 


ATATGGGAAA 


120 


GTATTGTTAA 


TTATAAAGGG 


TAAGAAGCGA 


CTTACGAACC 


AATTTCCTTT 


GGTACAAAAA 


180 


AATAAGAGTG 


GCTACTTATT 


TGAATATGCT 


CAGATGAATA 


CACTTTATCT 


TAATCAACAT 


240 


TCTTCCTACA 


AAAATGATGA 


AGATAGTCAT 


TCCTTTCCAA 


TTCAGATGGA 


GTTAGTTTCT 


300 


GATGAAATGA 


TTCCAAGAAA 


TTGAACAAGC 


TACTAAAAAT 


TCCGAATATC 


CGAAAAATTT 


360 


ATGATTTTAA 


CTTATGANGC 


GGACTCCCGA 


AAAACAATAT 


TATATCTGTT 


AGATGTTTGT 


420 


TATGCCCTGA 


TGCCCGAACT 


GGTCCAGTTA 


CACTTGATTC 


CAGATTGTCC 


TGAGTATATC 


480 


CATCCAGTTC 


CGTACC 










496 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



CCACCCTCTA 


GCAAGTGGTG 


CTAATTCCTA 


TAGCCTAGAA 


GAAAACGAAA 


TCATCTACCA 


60 


AAAGTTAAAA 


AACTTGCTTA 


AAAACTGATA 


AAATACTTGC 


CAAACTTTTC 


AGAATCTGAT 


120 


AGACTAGTAT 


GGTAACAATC 


TATGGCTCGC 


AAAGAGACCA 


TGGCAGAAAG 


GAAATATTGC 


180 


AAAATGAAAA 


AAGATATCCA 


TCCAGAATAT 


CGCCCAGTTG 


TCTTCATGGA 


CACAACTACT 


240 


GGTTACCAAT 


TCCTTAGCGG 


TTCAACAAAA 


CGCTCTAACG 


AAACAGTTGA 


GTTCGAAGGC 


300 


GAAACTTACC 


CATTGATCCG 


TGTGGAAATT 


TCATCAGACT 


CACACCCATT 


CTACACTGGA 


360 


CGTCAAAAGT 


TCACTCAAGC 


AGATGGACGC 


GTGGATCGTT 


TCAACAAAAA 


ATACGGTCTC 


420 


AAATAATGAT 


AAGAGAACAG 


TTTCGGCTGT 


TCTTTTTTGT 


TTCTTGAAAT 


CAACTGCTGT 


480 


TTTCATGTTC 


CAGACTCATC 


TGTA 








504 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 



CTCAGGCGAT TACTAATTAC ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 60 

AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 120 

AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGCGCAGTAC 180 

CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 240 

GTTGATGATG GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT 300 

GACAGGGTGT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG 360 

ATGAAGCCAG GCTCACGGGG ATTATCTGAT TTTTATTGAC TCAGATGATT ATATCCATCC 420 

AGAAATGATT CAGAGCTTAT ATGAGCAATT AGTTCAAGAA GATGCCGATG 470 



(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 



CTTATACTTG CTTTTTTCCT TTTGAAAAAT GTTAATAAAT CGTTGGTAGT CTTCTCTAGG 60 

CATGGACAGA TCAATATCGT CGTCCCAAGG GATAAAGCCC TCATGTCGAA CCGCCCCAAT 120 

CAGAGTACCG TAGTTAATAA TATAGTTGAT ATTGTGCTTT TTACAGAGAG TATCAATATA 180 

ATCCAAAATT TCTAATTCAA TTTGTTTGGC ATCTTCAATG GTTAGTTGTT TCATTTTAAA 240 

CTCCTATGAT TTTTTGAATT TATTTTTTAA GGCTAGGACA TGGTTTAAAA ATTCATAGAA 300 

AATGCTATCT TTTGTGAAGA CAAGTAGACT AATATAAGAG ATAGCTGATA ACAAGACAAT 360 

CAAACCAGTA TTAATCAAAA ATGGCAAATT AATGACCATA TCCACAGGAT ACACGAAATT 420 

AATCAGGAAA TACATTGCTA CAAAGGAAAG TGAAAAGAGA GAGTATCGAA CAGTATAGCT 480 

AAAGATATGT CCCAAGTGGA TGAGTTGTTT CTATGGATGA AATGAT 52 6 

(2) INFORMATION FOR SEQ ID NO: 184: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 184: 



PCT/US97/05306 



ACCATATCAA 


TACTATCGAA 


AAGTACAAGG 


GATNCAGTCT 


CAAGGTCGCT 


GAGGAAGATT 


60 


TGAATGACCT 


AGACGATGGT 


GAATTTTACT 


ATCACGAGAT 


TATCGGTTTG 


GAAGTCTATG 


120 


AGGGTGATAG 


CTTGGTTGGA 


ACCATCAAGG 


AAAATCCTGC 


AACCAGGTGC 


TAATGATGTC 


180 


TGGGTGGTCA 


AACGAAAAGG 


CAAACGTGAT 


TTGCTTTTAC 


CTTATATCCC 


ACCAGTGGTT 


240 


CTCAATGTTG 


ATATTCCAAA 


TAAACGGGTC 


GATGTGGAAA 


TCTTAGAAGG 


GTTAGACGAT 


300 


GAAGATTGAT 


ATTTTAACCC 


TCTTTCCAGA 


GATGTTTTCT 


CCACTGGAGC 


ACTCAATCGT 


360 


TGGAAAGGCT 


CGAGAAAAAG 


GGCTCTTTGG 


ATATCCCAGT 


TNTCATAATT 


TTTCGAANAA 


420 


AATGCTGAAA 


AGG 










433 



(2) INFORMATION FOR SEQ ID NO: 185: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTI<> : SEQ ID NO:185: 



CTGCAACTTG 


AATTTCTCCT 


TCTTGTTTCA 


AAGCAAGATA 


AACAATTCGA 


GCCCCTCTTT 


60 


TTTCTAGCAA 


ATCCCCCATC 


TGGACAGATT 


GCATAAAGGA 


ACGAGAAGAA 


ACCTGATCAG 


120 


AATAAGTCTG 


AAACTCTTCT 


TTCGTGAGTG 


TTGTTAGTGC 


CATATACTTA 


CTTTCTATGT 


180 


TTTTTTCTTA 


ATGTTTTACG 


GAAATCAAGA 


GCAAGTCTTA 


ACAGAGGATA 


GAGAGGATGA 


240 


GTGGGCATTG 


TAAATTCACC 


CAAGTATTCT 


TCAATCGTTG 


GATTAAATTT 


TTCCTTAAAA 


300 


TGATAAAGTC 


CACCATTGAG 


AGAGTTTTCA 


ACACCACCTA 


AATTTTGCCA 


GACCATACCT 


360 


CGCTCAAAGG 


CATAGCGAGC 


CGTTTCATAC 


CATGTTAAAA 


TTGGTGCATT 


GTAACGTTTA 


420 


AAATCATCAT 


CCATACCAGC 


ATATATATTG 


ACAGAGGTAG 


TACCAAATTC 


CAAACTCAAA 


480 


GTACCGCTAA 


AGGAATCTCG 


CTTGACCTAC 


ATCTATATAT 


TCCTGCAAGA 


AGGTCAATTC 


540 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 : 
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(2) INFORMATION FOR SEQ ID NO: 187: 



60 



CTCAGGCTAA AAGAGTCCAC TGGACTCTTT TACTCCGTCC CATAACCAAT GACTTAATGG 
CGCGTTCAGC TAGATTATTG GAAAGGACCA GATGTCCGTC TTTCAAAATA GTCTTAAAGG 120 
TTTCTTCATA CTTGAGGCTG TATTCAATTG CCCTTCCTAG TTTTGAACCC GATAAAACTG 180 
ACTGACGCCG GCACCAAGCA AAGAAATCTT CCATTAGGGG TTGGAGCTCT TCTTGACGTT 240 
TCTGTAGTCG TTCATCAGCT GACAATGTCT CCCAGTCTCT TTCCAAGGCA AATAACTGAT 
CACAATAGGC TAATCCTTTA G 



300 
321 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



CTTGACCAAA CAAGGCAGTT GGAAATTGAT TTTGGAGGAT GAGAGCGCTG GAAGGGAACT 

CTTCCCAATC TTGACTCAAG GTCNATATAT CGCAACATTT GACCAACAAG CGCCANCTAT 1 

CGATGAAATC TTTAAACTAT AATCAGGAGT GGAAGTATGA GAAATATGTG GGTTGTAATC 1 

AAGGAAACCT ATCTTCNACA TGTCGAGTCA TGGAGTTTCT TCTTTATGGT GATTTCGCCG 2 

TTCCTCTTTT TAGGAATCTC TGTAGGAATT GGGCATCTCC AAGGTTCTTC TATGACTAAA 3 

AATAATAAAG TGGCANTACT GACAACAGTG CCATCTGTAN CATAAGGACT GAAGAATGTA 3 

AATGGTGTTA ACTTCGACTA TAAAGATGAA GCAAGTGCCA CCAGAAGCAA TTAAA 4 

(2) INFORMATION FOR SEQ ID NO: 188: 



fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 188: 

CTATNAAATA GAGTGGTTCT CTATCTGCCA TGACCACAAA TNAGGACAAT GATACTTCTG 60 

AACGTTCAGA CTGCTATCGT AAAAGGACAG CGGGTGAGAC GCCCATGAAT GATCTAACCA 120 

GTCATACCCA CGGAGGAAAT TATACAATAG CACGTTATCA GGAGGAAAAA TTTTGGAACA 180 

AACAGTTGTA AAATATAGTC TGCAGTTATC AATGCTTAAT TTTTTACAGG CGAAGATAAT 240 
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CGAACG 
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246 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i nea r 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:189: 



CTTGTCATCC 


TCACGAAAAT 


TGCGATTGTA 


AAGCACCAAG 


CCTTGACTCG 


TGATAGACTG 


60 


AATCATGCTT 


CTCTCATGTA 


CTCCTCAAGT 


CGTTTCATGG 


CTTCTTTGAT 


AGTCTCCATG 


120 


CTGGCTGCAT 


AAGATAGGCG 


GACGTAGCCT 


TCCCCGTAAC 


GTCCAAAGGC 


TGCACCAGGG 


180 


ATAAAGGCAA CGGCCTTCTT 


CTGAGCAAAA 


TCCTTCAGAA 


AAGCAAAGGA 


GTCTTGATTG 


240 


TAGCCCGCTG 


GAATTTTAGC 


AAAAATATAG 


AAGGCACCGT 


CTGGTTTGAT 


AATCTCAAAA 


300 


CCAAGAGCAG 


TCATTTTTTC 


GATGATATAG 


TCCCGACGTT 


GGATATATTC 


CTTCTTCATG 


360 


GGCTCCGCAT 


CGTTTTTACC 


AGCCGTCAAG 


GCTTCTACCG 


CAGCATGTTG 


CGCCATGGTA 


420 


TTTGCGGCAG 


TGACC^AGTT 


ACTGGTGACT 


CTTGATTAAC 


TGGGCTGTGA 


AGGTCNCANG 


480 


AGCAAAAATC 


AGCCCCAAAC 


GCCAACCTGT 


CATGGCATG 






519 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 



CTTCAGCTAT 


TCGTGGGATA 


TTAGATGTAC 


TTATTTAGGG 


GTTGAAATCA 


TATGAATATT 


60 


ACCAATTTGT 


TTTCTATCAA 


GACAGGATGT 


GATGAAACTG 


ATAGGCAACT 


GCAAAAACTA 


120 


TTTTTTCAGT 


TGGATTTACA 


ATTGGGAGAA 


TTGACAGATC 


AACTAAGAAA 


ATTAGATTCT 


180 


AATTTTGTTC 


CTCGTAGTCA 


ATTTGTAGAC 


ACGTTGGATT 


TGAATGATGT 


AGAATATAAA 


240 


GAAATTTTAA 


ACTATTTTAT 


CTTCCATCGT 


AATGATAGTG 


AAGAAAGTTT 


GGTAGAATGG 


300 


TTATATGATT 


GGATTTCCAC 


AAATCGTTAT 


GAACTTCCCT 


AAAGAGTTTT 


CCGATTCCGT 


360 


ATGGCTCCAT 


AAATACCCAT 


GAAAGTGTTA 


CTGAAGTTTT 


CCGGAGAATG 


AATAACTAAA 


420 


AAACAGTCAT 


TAGTGACTGT 


TTTTTATAGA 


AAAGAGTTTT 


TATATGTT 




468 
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(2) INFORMATION FOR SEQ ID NO: 191: 



(i) SEQUENCE CHARACTERISTICS: 
£A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



(2) INFORMATION FOR SEQ ID NO: 192: 



60 



CTTGAATAGT CTCATCACGA AAATTGCCAT AACTGATAAA ACCAACCACC TTCACGCCAT 

CCATCGCAAT CAATGTATTT TCTGGATACT TTTGACTAAA GAATCGACAT CTTTCTAATG 120 

TCATTGTATC CTGAAATTCC GCAGGCAAAA GGTCATCATA AGACTCTCTC CACGTTTGCC 180 

AGTGAACGAG GGATTTGCCT TCTATCTCTT CAGGAGTTNC CATAGATTTG ATAATAACCG 240 

TCATTTATTT TCTCCCAGTC TNCTCTCAAA ATACCATATT TAATACTATC AAAATATTTA 300 

CCTTGATAAT AACGAACTTT NGGAATATGA GCTNCTTTTT TCATNCTTAA TTTTTCAGCA 3 60 

AGTTTCATCA TACCAAGATT TCCTGACCAA GTTGTCAAAA CCAGATGCTC CAACTCCAAG 420 

TAATCCTGAA ACGTCCTATC TATCCACTGC AACATAGCAA CTTTCCCAAT ACCAGTGTTC 480 

CAGAATTTTT TATCATAAAT ACCAATTCCC AATCCATCCA TCTTGTTTCC TTTACP a'ACC 540 
CAATA 



545 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CTTGATTGTG GAAGAAGTCC GTGATCGCTA TGTGGGCAAA GTCGATGCCG TTTTTCATAA 

CGGCGATTCT GAACTACGTC CGGATTCTCC ACTTTGGGAG GGCATCCGCG TTGTTAAAGG 1 

GAACATGGAC TTCTACGCCG GCTACCCAGA ACGTCTGGTG ACTGAGCTTG GTTCGACCAA 1 

GATTATCCAA ACTCATGGTC ACTTGTTTGA CATCAATTTC AACTTTCAAA AGTTGGACTA 2 

CTGGGCTCAG GAGGAAGAGG CCGCTATCTG CCTCTATGGT CACTTGCATG TGCCAAGTGC 3, 

TTGGTTGGAA GGCAAGATCC TCTTTCTAAA TCCAGGCTCT ATCAGTCAAC CACGAGGTAC 3i 

CATCAGAGAA TGTCTCTATG CTCGTGTGGA GATTGATGAT AGTTACTTCA AAGTGGACTT 4' 

TTTGACACGA GATCACGAAG TGTATCCAGG TTGTCCAAGG AGTTTAGCCG ATGATGCCAA 4 | 
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(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CGTAACCATT 


AACTATGAGG 


TGGTTTGCCT 


CCTCAGCGAC 


CGTATTCCGA 


GAGAATATTA 


60 


TTAGAAAAGA 


AAGGAGTGGA 


GCATGAATCT 


ACATCAACCC 


TTGCATGTCT 


TGCCTGGTGT 


120 


GGGACCAAAG 


TCAGCAGAAA 


AATACGCCAA 


ACTAGGAATT 


GAAAACTTGC 


AAGATCTCTT 


180 


GCTCTACTTT 


CCTTTCCGTT 


ATGAAGACTT 


CAAAACCAAG 


CAGGTGCTGG 


AACTAGAAGA 


240 


CGGTGAGAAG 


GCAGTTCTTT 


CTGGTCAGGT 


AGTGACTCCT 


GCTAGTGTCC 


AGTATTATGG 


300 


TTTCCAAGCC 


GCCAATCCGC 


CTGCGTTTTT 


AGTCCTCCAA 


GCCAGGGAAG 


AAGGTTCGTT 


360 


TTTTTTTGGC 


GGGTGAAATT 


TCCTTTTTAA 


CCCAGCCCCC 


TATCCTTGGG 


CTTGAATAAA 


420 


AAA" v AG AAA 


TTTTGGGGAA 


CCAACCCCTT 


TGGCTTGT 
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(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



CCTTACTAGT 


TTACTCAATC 


TTTACATCGT 


TCTTTCGGCA 


GTTCAAACTC 


AGAATCCACA 


60 


GCACAGGTTA 


TGCAATTACT 


GTCTGAAAAT 


ATGTTAAAAA 


CCATTCAGTC 


ACTTTCGGTC 


120 


TGGCAGATTT 


ATTTGCTTGG 


TTTTGAGCGA 


ATCTTGGCGC 


TTGGTTTCCA 


ATTACTTTTG 


180 


ACAGTTTGGG 


TTTACCAAGC 


TGTTCGCCAG 


AAGAAATGGA 


TTTATCTCCT 


AGCAGCCTAT 


240 


GGCTTGCATG 


CCTTCTTTGA 


TCTGGCACCA 


TCTCTTTTCC 


AAGTAGGCTG 


GTTGACAAAT 


300 


CCAGTCTTGG 


TTGAAGTGAT 


TCTAGCACTG 


GAGCTCGTTC 


TGGTCGCCTA 


TGGAACCAAG 


360 


GAAATCTTTT 


GTAAAAAATC 


ATAAAAAGGG 


GGGAACCTCT 


TTTTCTTATG 


CAAAATCCAA 


420 


ACAAGGTATT 


TTTATGGTCG 


TCAAATGTCT 


CTAAAAATGG 


TATAATGGAA 


TGAATTTTGT 


480 


AAAAGGAAGA 


ATGACATGTC 


TGTAAGAGAA 


AAAATGCTTG 


AAATCTTAGA 


AGGAATTGAT 


540 
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(2) INFORMATION FOR SEQ ID NO: 195: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:195: 



CTTTGACAAA GAGTATGACA CCTGACCGTG AAGTCATTAC CTTTATTCCT GAAAAATTTA 60 

TTGTGGATGG TTTCCAAGGG ATTCGTGACC CACGTGGCAT GATGGGGGTT CGCCTTGAAA 120 

TGCGTGGTTT GCTTTATACA GGACCTCGTA CTATCTTGCA CAATTTGCGT AAGACGGTTG 180 

AGCGTGCAGG TGTTCAGGTT GAAAATGTTA TCATTTCACC ACTAGCAATG GTTCAGTCTG 240 

TTTTGAACGA AGGGGAACGT GAATTTGGTG CTACAGTGAT TGATATGGGG GCAGGTCAAA 300 

CGACTGTCGC TACAATCCGT AATCAAGAAC TCCAGTTCAC ACATATTCTC CAAGAAGTGG 360 

AGATTATGTA ACTAAAGATA TCTCCAAGGT TTTGAAAACC TCTCGCAAAT TAGCGGAAGG 420 

CTTGAAACTG AATTACGGGG AACCTATCCG CCTCTTGCAA GCAAA/ AACT CCATTAAAGT 480 

TATTGGAGAA TTAAACCAGT CAAAGTGACG GAACCTACTT GTC S2 3 

(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 196: 

CTGTGATTTC AGAGAAGAAA TCAAGTGCTG TAACAGAAGT AAGATGTAAT TGTATGTAAA 60 

GGAGACGTCA TGTTAAATAG TATTGTAACC ATTATTTGTA TTGCCCTTAT CGCGTTTATC 120 

TTGTTTTGGT TTTTCAAAAA GCCTGAAAAA TCTGGACAAA AAGCCCAGCA AAAAAACGGA 180 

TACCAAGAGA TTCGAGTGGA AGTCATGGGA GGCTATACTC CTGAGTTGAT TGTCCTCAAG 240 

AAATCAGTGC CAGCCCGCAT TGTCTTTGAC CGCAAGGATC CTTCACCATG TCTGGATCAA 300 

ATTGTTTTTC CAGATTTTGG TGTACATGCG AACCTGCCAA TGGGGGAAGA GTATGTAGTG 360 

GAAATCACGC CTGAACAGGC TGGAGAGTTT GGCTTTGCTT GTGGTATGAA CATGATGCAC 420 

GGCAAGATGA TTGTAGAGTA GGTGGAGACT ATGACAGAAA TTGTGAAAGC AAGCTTAGAA 480 
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AATGGCATTC AAAAAATCCG TATCCGAGCT GAAAAAGGCT ATCATCCACC CATATCCA 536 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



CCTCAAGAAG 


TCCGCGATAC 


TTACGAAAAT 


ATTTTGTAGA 


GAATCGAACC 


GCAAGGTTCG 


60 


GTTTTCTTTC 


TCTTTTTGTC 


TATAATTTGG 


TATAATAAAC 


AGTATGAAAA 


TCGTATCAGG 


120 


AATCTATGGG 


GGACGTCCCC 


TCAAGACACT 


AGAAGGCAAG 


ACAACAAGAC 


CTACTTCGGA 


180 


TAAGGTTAGG 


GGAGCCATTT 


TTAACATGAT 


TGGTCCCTAC 


TTTGAAGTGG 


GACNAGTCTT 


240 


GGACCTTTAT 


GCAGGTAGTG 


GTGGTTTATC 


TATCGAAGCA 


GTATCGCGTG 


GCATGTCCAG 


300 


TGCTGTTTTG 


GTGGAGCGAG 


ACCGTAAGGC 


TCAGACCATC 


GTGGCTGAAA 


ATATCCAGAT 


360 


GACCAAGGAA 


GTTGGAAAAT 


TTCAACTCCT 


CCAAAATGGA 


TGCAGAAAGG 


GCATTGGAAC 


420 


AGGTTNTCTG 


GGGAATTTGA 


CCTCGTTTTC 


TTAGACCCTC 


CCTATGCCAA 


GGAACAAATC 


480 


GTAGCAGATA 


TTGAAAAAAT 


GGCTGAGAGA 


GAGCT 
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(2) INFORMATION FOR SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



CTGGANACAA 


ACTTTACAGA 


TCAAGTCGAT 


ACGATGATTT 


ATGTTGATAA 


AGAAGAAAAA 


60 


GAAACTATTA 


AAGCTGCACT 


TGTGGAGTTT 


TTTAATGGAA 


AAGTCACTTT 


AACTGACCAA 


120 


GGTTTACGAG 


AGGTTGAAGT 


TCCTGTAAAC 


TTAGTGTAAA 


CAATGAATAA 


TACAGCGTTT 


180 


CGTTGACATT 


CTCACAACTA 


CTTTAGCGAG 


CAAAATAAAA 


AGATGCGTAC 


CAAAATATAC 


240 


TAGAAAATGA AGCAATTCAA ACGAAACCTG 


ATATCGTTTT 


CCTTCACACC 


TATTTACTAG 


300 


AATTAACTGA 


ACGCAATCAC 


TTGAAAATTA 


ATGACTTTGG 


AT 
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(2) INFORMATION FOR SEQ ID NO: 199: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 199: 



CTATTAGCGA CTTTCTCTGA AATATGGTAT GATAAAGGAT ATACAAGGAG ATAAAATGAA 
TAATAATTTA CTGGTATTAC AATCCAGACT TTGGTCTGGT TGATGGTGCG GTATCGGCTA 
TGATTGGAGT GGCTTTAGAA GAGTCTCCAA CCTTAAAAAT CCATCACTTG ACGCACGATA 
TCACGCCTTA TAATATTTTT GAGGGGAGCT ATCGTCTCTT TCAGACGGTG GATTACTGGC 
CTGAGGGAAC GACGTTTGTA TCGGTTGTCG ATCCAGGTGT CGGTTCGAAA CGTAAGAGTG 
TAGTTGCCAA GACTGCTCAA AAATCAATAC ATTGTCACGC CAGATAATGG GACGCTTTCC 
TTTATCAAGA AACACGTTGG CNTTGTANCC ATTCGTGAGA TTTCTGAGGT GGCCAACT 

(2) INFORMATION FOR SEQ ID NO:200: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



CTCTCGCTTT TCTCATAGTG GGAGGTAAGG ATGGAATTAC GCAGACCAAG ATTAGCGGAT 60 

AAGAAAGCTG TTTTAGATAT GATGACAGAG TTTGAAAAGA GCCAATCAGC CCATGATGGA 120 

GGATTTTGGG ATACAGAGAA TTTTGTGTAT GAAGAGTGGT TGGAAAGCAA TCAGGAACAG 180 

GAAATGGGGA TTAATCTGCC TGAAGGATGG GTTCCTGCAA TTCAGTTAGT GGCTTTTTCT 240 

GAGAAAGGTC AAGCAGTTGG ATTTCTTAAT CTCCGGTTGC GCCTCAGTAA CTTTCTACTA 300 

GAAGAAGGTG GGCACATTGG CTACTCCATC CGTCCATCTG AAAGAGGCAA GGGTTATGCA 360 

AAAGAACTCT CCGTCAGGGC TTGCAAGTTG CTAAGGAAAA GAACATCAAG AAAGCTCTGG 420 
TGACCTGTAG CGTGAATAAT CCTGCTAGCA GAGCATCATT CTA 

4 d J 

(2) INFORMATION FOR SEQ ID NO: 2 01: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 475 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



CTCTTACGTG 


ACCAGAACAT 


GCTACTTCCC 


ATCCGAGAAG 


CCATTCTATC 


TGGCTTACCA 


60 


GTGTTTGGGA 


CCTGTGCGGG 


CTTAATTTTG 


CTGGCTAAGG 


AAATCACTTC 


TCACAAAGAG 


120 


AGTTATCTAG 


GAACTATGGA 


TATGGTGGTC 


GAGCGTAATG 


CTTATGGGCG 


CCAATTANGA 


180 


AGTTCTNCAC 


GGAAGCNNAA 


TGTAAGGGAG 


TTGGCNAGAT 


TCCNATGACC 


TTTATCCGTG 


240 


GTCCGATTAT 


CAGTACTGTT 


GGTGAGGGTG 


TANAAATTCT 


AGCAACAGTG 


AACNATCAAT 


300 


TTGTTGCNNC 


CCAAGANNAA 


AATATGTTGG 


TAAGTCCTTT 


TCATCCAGAA 


TTGACTGATN 


360 


ATNTGCGCTT 


GCNCCAGTAC 


TTTATCAGTA 


TGTGTAAAGA 


AAAAAGTNGA 


GATTGAATTT 


420 


CTCCAACTTT 


NCCACNTGTA 


ATNNNCAATN 


NCNATGTATT 


GGAGTACGGA 


CGCAG 


475 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 : 



CCTTCAAAAA 


TTATTTGATG 


CAACCTTAGA 


CAATCCAGAA 


CTCACCGCAA 


CATTAGTGCC 


60 


TTTAGGAGAT 


GGTATTCTCA 


TGCTTCGTAA 


AAATGTAGCA 


GATGTTCAAC 


TGTCTGAAAG 


120 


CGAATGATTT 


TCAGAAAAAT 


TTAAGAAAAA 


ATAGTAAAAT 


AGATAGAGTA 


ACACTTATCT 


180 


CAAAGGAGTA 


GACATGAAGA 


AAAAATTATT 


GGCAGGTGCC 


ATCACACTAT 


TATCAGTAGC 


240 


AACTTTAGCA 


GCTTGTTCGA 


AAGGGTCAGA 


AGGAGCAGAC 


CTTATCAGCA 


TGAAAGGGGA 


300 


TGTCATCACA 


GAACATCAAT 


TTTATGAGCA 


AGTGAAAAAC 


AACCCTTCAG 


CCCAGCNAGT 


360 


CTTGTTAAAT 


ATGACCATCC 


AAAAAGTTTT 


TGAAAAAACA 


ATATGGCTCA 


GAGCTTGATG 


420 


ATAAAGAGGT 


TGATGATACT 


ATTGCCGAAG 


A 






451 



(2) INFORMATION FOR SEQ ID NO:203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



(2) INFORMATION FOR SEQ ID NO: 2 04: 



60 



CTATGGATAA GGCACTTGCT GACCTCAAAA CATCAGGGCA CTTGCCTATT CCGCGACACC 

TACGTGATGG GCACTACANT GGAAGCAAGG AACTGGGGAA TGCCCAAGAC TATCTCTATC 120 

CACACAACTA TCCTGNAAAT TGGGTCAAGC AAGACTATCT GCCACAAAAA ATTCGTAATC 1B0 

ATCACTATTT CCAAGCAGAA TATACTGGTA AATATGAACG GGCTTTGGCT CAAAGAAAGG 240 

AAGCTATCGA CCATTTGCGA AAAATCTGAA ATCCTTTTCA AAAAATTGCA CTTTCCTCTT 300 

GATTTTTTTT GAAAAAGTGG TATCATATAA ATATAGAAAC GCTGTGGTGT ACGACTTCAC 3 60 

ACTTAAGTGT TGACCGACTA TTTTTTGTAT TATTANGGAA ACAAAAGTCT TCTGTCAGCA 420 
TGTAGGCCGT CTCACACGGA AACAGCTTCA GTT 



453 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: 1? iear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 



CCGTAGTACA GGCACAAAAA GATTTGGAAA ATAGAAAAAG AAAAGCCAAG AAAAAGGCTC 60 

AGAAAACGAA ATAAATAAGG AGGAATCTGG TAATGGTAGT ATTTACAGGT TCAACTGTTG 120 

AAGAAGCAAT CCAGAAAGGA TTGAAAGAAT TAGATATTCC AAGAATGAAG GCTCATATCA 180 

AAGTCATTTC TAGGGAGAAA AAAGGCTTTC TTGGTCTATT TGGTAAAAAA CCAGCCCAAG 240 

TGGATATTGA AGCGATTAGT GAAACGACTG TTGTCAAAGC AAATCAACAG GTAGTAAAAG 300 

GCGTTCCGAA AAAAATCAAT GATTTGAACG AGCCTGTGAA GACGGTTAGT GTAAGAAACC 360 

GTTGACCTTG GTCATGTGGT TGATGCTATT AAAAAAATAG AGGAAGAAGG TCAAGGTATT 420 
TCTGATGAAG TC 

432 

(2} INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



AAAATCAAGA 


AATTCTTTGA 


AGGAACGACA 


GATACAGCTG 


AGAACTATAT 


CAAGTCGGCC 


60 


CTTAAAATGT 


TGGTCAAATA 


GGAGCAGAGA 


ATGACAAAAC 


GTTGTTCGTG 


GGTCAAGATG 


120 


ACCAACCCGC 


TCTACATCGC 


CTATCATGAT 


GAGGAGTGGG 


GTCAGCCCCT 


CCATGATGAC 


180 


CAAGCATTGT 


TTGAGTTGTT 


GTGTATGGAA 


ACCTATCAGG 


CAGGCCTGTC 


TTGGGAAACG 


240 


GTACTCAACA 


AACGCCAAGC 


TTTCCGAGAA 


GCCTTTCCAT 


AGCTATCAAA 


TTCACTCCAG 


300 


TCGCAGAGAT 


GACTGACACT 


GAATTGGAAG 


CCATGCCTGG 


AGAATCCCAG 


CTATCATTCC 


360 


GAAATAGAGC 


CCAAGATTTT 


TGCTACACGC 


CGCCTAACGC 


CCCAACCCTT 


TCCTACAGTT 


420 


ACAGGCAG 












428 



(2) INFORMATION FOR SEQ ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 518 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



CTATTAGATG 


GTATTGACTG 


CCCAGACACT 


TACGGATCTA 


GCTGAGACAG 


GGAATTGTCC 


60 


ATAACCTTCC 


TCATCAATTG 


TAACTTGACC 


TTGGTGGTTA 


CCAAGTAAAT 


CTACAAAGGT 


120 


TTGATTAGTC 


CATTCTTGAC 


CGACAAACAT 


TGACTTGCTG 


TTTTCTTGGT 


CATTTGAAAT 


180 


CAAGACTGCG 


ATTGGGGATT 


GATTTTCAGC 


ACCTGAACGT 


ACCCAACCGA 


TACAGTTAGC 


240 


ATGGTCAAAG 


TAGTCATTTT 


GTTCTCCATA 


GGCCAAATCT 


TTTCGGATGG 


CTAGGAGGCG 


300 


GTCAAGGATT 


TCTTTGAAAT 


CTTGTTGAGC 


ATACTGGCCT 


GAAATCCCAT 


AGTAGTCTCC 


360 


GTAAAAGACA 


CATGGAAGGC 


CGTCTTGGCG 


TAACAAAATG 


AGGGCATAGG 


CTGCTGGCTT 


420 


GAACCATTCT 


TCAACGGTAG 


ACTCAAGGGC 


TTGTCCTCGT 


TGGGTATCGT 


GGTTGTCGAC 


480 


AAAATCACAG 


ACTTGTCAGG 


CTTGAGTTCA 


ACCAGGCT 






518 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



{2) INFORMATION FOR SEQ ID NO: 208: 



CTCTGTCCTT TTACCTGAGA GTTTGAGCAG TTGCCTGCCT TGCCCCTTCG GTCCCTTTAC 60 

GGTCTCTCCA GAGTTCCGTC CATTTACAGT CATGGAAAAT CAAACGATTC CCCACTTCTA 120 

TTAAACTTCA TTCGGTGTTG GTATTTAATT GATTCTAATT TTACAAAAAA TGTTGGCTTT 180 

TGTCAATGTG TTTATTAGTA AAAATTAGTT CAACAGTTTT TACTTTATAA AGTCCAGAAT 240 

ACTGCTATCC TTTAAAAGTG ACAATAGTCG CACCACTGCC TCCAGCATTT TGTGGGGCAT 300 

AGCCGAAACT CTTGACATGT TTGTTTCTTT GCAAGTATTT GGTAACTCCT TCACGGATGA 360 

CTCCTGTTCC GATACCATGG ATGATATCAA CTTGAGCCAT ATTGTTAAGC AAGGCTTGGT 420 
CGAATGAAGG TATCTAGCTC ATTCATGGCT TCTTCATAGC GCTTGCCTCG AAGATTCAGT 
CTAGCTTGAG TCCTCGCCCA GAAGTTCG 



480 

508 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MjLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



CTCTTGGGCT CTCTTCTCAG CAACTTCCCT GCTTTCACCT TTGGTCTTGA TAGGTGANAA 60 

CGTAATATNT CCAATCNNNG TCATATTAGG AAANAGTTAA ATTGTTGGAA AACCATGCCC 120 

ATCTTCTCAC GCATGGGAAA CAGGTCATTC TTCTTGTCCG TAATATCGAC TCCCTCAAAG 180 

ATAACCTTCC CCTTGGTTGC TTCCTCCAAC AAATTCATAT AGCGAAGCAA GGTAGATTTC 240 

CCACTCCCTG AAGGACCGAT GATAACGACA ACTTCTCCTC TTTTAATCTC GAGGTTGATG 300 

CCCTTCAATA CTTCATTCTT TCCAAAAGAT TTATGTAATT TTCAATTTTT ATCAAGGTTT 360 
CTGTCATTAT TTCTTATC 

378 

(2) INFORMATION FOR SEQ ID NO: 209: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
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CTGGCCAGCC 


AGATAAACCT 


CGATACGGCT 


TTCTGCTCGT 


TTGATGCGTT 


GCAAGAGTCC 


60 


GCCCATACGG 


ACGTCAACTG 


TATCCAAACC 


AAAGACCTTG 


TTTTATTTCA 


GCCAGTGGTG 


120 


ACTAAAGAGG 


GCATGGAAGT 


CTTCAATTTG 


GCTTCTAAGT 


TCTGGTAATT 


CTTGTCTGGC 


180 


GATTTGTTGT 


AAACTTTCTT 


TATCATCCGC 


TTGGTAGGCA 


TGACGAATGC 


GTCGTCCCAC 


240 


ATCTACTTTG 


CTACTTAAAA 


TAGCATTCAA 


CTGGGCCTGA 


GTTTCAAAGA 


GATAAGCATA 


300 


GTTTCCAGCT 


TTTTCTTTAA 


TGTCAGCAAT 


AGTTCCCGCC 


GCCTGAGCGA 


AGTGTGGCTT 


360 


GTCCTGTTCA 


GGTGTCATGT 


GTCGGTCAAG 


TATCGGACAG 


AGAACATCCT 


GATAAAAGAC 


420 


ATAGCGGTTG 


GGATTGATGC 


CACTGAGATT 


ACCTGGTAGG 


TCTGGTAAGA 


GGTTGGCAAG 


480 


ATCAATCTGC 


ATAAAATCCT 


CAACCGTTAG 


ACCAGTATTG 


GTCTTGAAAT 


GCGCAGACAA 


540 


ACTATCTAGG 


TCATTGCGAT 


AGATCAAGTT 


ATCGATACCG 


TCGACCTCGA 


GGGGGGCCGT 


600 


ACCCAGATTT 


GCAAGCTTGG 


TAGGATAAAG 


AACTGGGCAG 


TTTCACCACC 


ATTGTCTCCC 


660 


CAACCCGTTA 


CGATGACTTC 


TTTAATCTGA 


TTGGCACGGC 


AGGCTTTATT 


AGCCTCGATA 


720 


GCCACTAGAC 


GGCTAAAATG 


G 








741 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



AAACTCGATT 


TGAGAGAAAG 


TCCAATAAGT 


CTTTCCATAA 


TAAAACGCAT 


AGTAGCAAGA 


60 


GTTTCTACAC 


CCACTATTAT 


GCGTTTTTAT 


CATTTTGAAA 


TCTACTGCTA 


ATCAATGAAG 


120 


AAGCAAAGAT 


CAAACTAGGC 


AGCTAGTTAC 


AAGTTACGTA 


AGCGCGTCAT 


AACAAGGTAT 


180 


CTATCCTAAT 


TCCCCAACTT 


CCAGTATACT 


AGATAAAAAA 


TAATTATCGG 


AGGAAAGGTA 


240 


TTGTACTATG 


ATAATTCAAC 


TAAGTGATTT 


AGGTCAAGTT 


CACCTTGTTT 


GTGGCAAGAC 


300 


AGATATGAGG 


CAGGGAATAG 


ACTCATTAGC 


CTATGTAGTT 


AAAACCCACT 


TTGAATTGGA 


360 


TCCTTTCTCC 


GGTCAAGCCT 


TTCTCTTTTG 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGCCCT 


420 


TTACTGGGAT 


GGTCAAGGAT 


TTTGGCTACT 


ATATAAACGC 


TTTGAGAACG 


GAAAACTGAC 


480 


TTGACCAAGT 


ACAGAAAAGG 


ATGTCAAAGC 


TCTCACACCA 


GAACAAGTAG 


ACTGGCTTAT 


S40 


GAAGGGCTTT 


TCTATCACTC 


CCCAAATATA 


GTGGATTTAA 


ACTAGAATAG 


TACACCCCTA 


600 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTGA 


CTGTCCTGAT 


CAATTTGTCC 


TGTTCTTATT 


660 


TCATTTTACT 


ATAGAATCCA 


TCTGAAAGCC 


GTTTATGATT 


TCTATTGAAA 


TGAAGACCGT 


720 


CCATTTTAGT 


AGACTAAAGG 


ATTACTCAAA 


ACTTCGAGAA 


GGAAGACAGA 


TGAACTTGTT 


780 


TCTTATAAAA 


ATGTTGTCTG 


AAACTATTCG 


CTTTTTTCCA 


GAAATTTTAG 


AGGAAGAAAA 


840 


CTTTATGAGG 


AAAAAGGAGT 


TACTCAAATA 


ACAATTCAGG 


ATTAAAAATA 


GACAGTTGAG 


900 


GAGCGGAAGG 


TATAAATTAA 


GTTTGCTACT 


GTATAATGGA 


TTTAAATCAC 


TCAGCAGACA 


960 


GAACGAATAC 


TCTTCGAAAA 


TCTCTTCAAA 


CCACGTCAGC 


TTCGCCTTGC 


CGTAGATCCA 


1020 
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TAGGGGACTG 
CGATCAATTT 
ATTTGATTTT 
TAATTTTTGG 
TTTAAAAAAT 
TTTGTCAATA 
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GACTTCATCA GTTCTATCAA CAACCTCAAA ACAGTGTTTC GAGCTGACTT 1080 
TATCTGCACC TCAAAGCTGT ACTTTGAGCA AGCCTGACGA CTAGCTTCCT 
CATTGAATAT CAGAAACTCA TTCTCCATCA AATAATTCGA CTGCGTCTAA 
TCTGGCACGG TGTCAGAAAT AAAGGTTGTG TATTTGGAGA GGGGATTAAT 
CCAGTCTTGT AAAATTTAGA ACTATCAATC AGTAAGATGG TTTCATGGGC 
ATATTCTTTT TTGAAATAG 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1359 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



1140 
1200 
1260 
1320 
1349 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 



CTTTTTTATA 
TCCGATTAAC 
GATTCATGAC 
CTTTCCACAC 
CAATATTAGT 
GTAAAAAATA 
CAGTATTACA 
CTCCATAACA 
TTGGAGACCA 
TCAAGCGAAC 
AGATTTTATT 
GTGGCATAAT 
AGGAACGACC 
CTGCCTCTGT 
ATGCATGGTT 
TAAAAGTTCT 
CTAGAATAGT 
GATTTGCCCT 
AGCTGTACGA 
CATATCACTA 
TAGGTGCTAT 
ACTTAGAGGC 
TTGGTTATAC 



AGCAATTCAA 
GATGGACTTT 
ATCTTCCAAA 
TTGTTCAATG 
CGGAATCTTT 
ATCATCTGGA 
TCTACCAGCT 
ATAGTGAAAT 
ACAAGATCCC 
CTTATGAAAA 
TTTAGACGCG 
ATTTCGCCAG 
TAACTCCTTT 
AGCCTTCAAA 
ACACCACCAC 
ATAATCTCTT 
ACACCTCTAC 
ATTCTTGTTT 
GAGTCTTTTA 
TTATATAATG 
GGATACACCG 
CCACATCAAG 
GACTTGGGGC 



TTTTATACTC 
ATCACCTCCT 
ATTCGAAAGG 
GGGTTCATCT 
AAGACACTTG 
TAAGCTTGTG 
ATTAAGAAAA 
TCTCGTATAT 
AGTTTACTGA 
CGTCTACTGG 
ACAATGGTTT 
CCATGGCGCT 
TTATAAGCCT 
TGGCGGGCAA 
TTTCCTGATA 
TATAAGATTT 
TTCTAAAACA 
CGTTTTACTA 
AAAGTGTTTT 
CTTTTTGATC 
GATGAGAATG 
GCCATGCGAG 
TGTATCGATC 



ATCTGCTTTC 
TCTCCAATCC 
CTTTATTCTT 
CTGGTGTGTA 
ATTTATGCCA 
AAAGCTCTTC 
AATGATTCCG 
AGTGACTATG 
TTCTACCGAA 
TTTAAATCGC 
GAGCGTCTGC 
TCAACAGTTG 
GAAATAAGGC 
GAAATACTTT 
GAGTTGTTCA 
GCCCATCAGA 
TTGTTAGAAA 
TAGAACGATT 
GATGGCTTGG 
TTTAGTTTAG 
GTTATGTAGC 
ATGCCATTTA 
CAGTTTCAG 



AAAAAGCATT 
TTGTATGACA 
AAATCCACGT 
TGGAGGAATA 
TATAGCATTG 
TAAAAAGGCG 
CCTGTTATGG 
GACATGTGGA 
ACCAGCCTCA 
TTTCTTGTCT 
TTTCTTAGGA 
ATAGAAGGCA 
ATCAATTGTA 
CTCTTCCTCA 
CATCTTATTT 
CGAAATATAA 
TCGATTTGAC 
TGAAGGCGTT 
ATTTCTTCTT 
TATTATCGTA 
AGATGACTAT 
CCNAGACGGG 



CTAGTCCATC 
TCTTGAAGTT 
TTACGAATCT 
AATGCAAAAC 
TCCATAACGA 
TTCATCCACT 
GCATCAACAG 
CCTACTCCTA 
TCTTGGTACA 
TCTTGAATTG 
TGTTCTGGAC 
TCACGTGTGT 
ACAAATTCTC 
ACTGTCATAT 
CAAACTACTA 
TAAATTGAAA 
TGTCCTGATC 
TATGATATTG 
TAGTTGATTT 
GAAAATGGAC 
CGGATTACTT 
GTTGACTTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1359 



(2) INFORMATION FOR SEQ ID NO: 212: 
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PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



CTAGTATTTC 


CGTTGTGNTT 


AGTTTGTCTG 


CCTATGGAGT 


TATTGCCTAT 


GTGCAAGGTC 


60 


AGNTGGATAT 


TCTTCTAGTG 


ATTCTGGCCA 


TGATTGGTGG 


TTTGCTCAGT 


TTCTTCATCT 


120 


TTAACCATAA 


GCCTGCTAAG 


ATCTTTATGG 


GTGATGTGGG 


AAGTTTGGCT 


TTAGGTGGAA 


180 


TGCTGGCAGC 


TATCTCTATG 


GCTCTCCACC 


AAGAATGGAC 


TCTCTTGATT 


ATCGGAATTG 


240 


TGTATGTTTT 


TGAAACAACT 


TCTGTTATGA 


TGCAAGTCAG 


TTATTTCAAA 


CTGACAGGTG 


300 


GTAAACGTAT 


TTTCCGTATG 


ACGCCTGTAC 


ATCACCATTT 


TGAGCTTGGG 


GGATTGTCTG 


360 


GTAAAGGAAA 


TCCTTGGAGC 


GAGTGGAAGG 


TTGACTTCTT 


CTTTTGGGGA 


GTTGGGCTTC 


420 


TAGCAAGTCT 


CCTGACCCTA 


GCAATTTTAT 


ATTTGATGTA 


AGAATGGCAC 


CCTGATGTTT 


480 


TAGGGTGTTT 


TTGTGTTTAA 


ATACACAATG 


AAAATCAAAG 


AACAAACTAG 


AAAGCTAACT 


540 


TTAGGCTGCT 


CAAAACATAG 


TATATTGAAA 


CTAAAATAGT 


ACACCTCTAC 


TTCTAAAACA 


600 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATT 


ACGATTTATC 


CT> TTCTTAT 


TTCATTTTAC 


660 


TATACCATCA 


ATAAGTGTTG 


AAAAGATCGT 


TGACAGAACT 


GACCAAGCCA 


GATCATCTTT 


720 


GTAACCACCT 


TGAGCTTGTA 


GCATATTTGT 


TAATTGAGCA 


AAGGCGCTAG 


AAGAGTTTGC 


780 


TGGGTCAGCA 


GTTGCGATTT 


TTCCTTTTAG 


TTCAGGTTTG 


AGAAGGTCGC 


TATATCCTTC 


840 


GATGTTCATG 


CCTTTAGTTA 


AATCAGGGTG 


GACGATTAAA 


ACACTACCAT 


CTAGTGTATA 


900 


AGGAGTAGAG 


AAGCCAGTTG 


TGTTTTGATA 


TTCTTTGATA 


ACATTATCAT 


TTTCTTTTGA 


960 


AATATAGTTT 


TCAAAGAGTT 


CTCCGTGGGT 


AGCATATTGT 


GTTATAAGAA 


CCACCAAAGA 


1020 


TAACATCAGC 


TACAGGAACT 


TCTTTTTCTG 


ACCTAGTTTT 


TTGAAAAGTT 


CTCCAGTACC 


1080 


AG 












1082 



(2) INFORMATION FOR SEQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1303 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

CTGGACTTGA TAGGCATCTT TGTAATCCTC TAAAGCCTCT TTCATCAAGG CACTACCAAT 60 
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TCCTTGACGC 
ATCTCCAACC 
TAATGAATGA 
GACAGCCTGA 
AATCATCTCA 
AAATTTTCAG 
ATCATAGACA 
GTTCCTTCAT 
TCGCATTTCC 
TACTGGGCTT 
ACTTCTATTA 
GTTGCCTTGT 
ATTGAAACTA 
CCTGATCGAT 
CAGATACCTT 
TCGATTTCTT 
CTATGTTAGT 
CAACACAGAA 
CTATCACTAC 
TATTTTGTCA 
ATATCATTTG 



TGATAGATTG 
AAACGAATCA 
GATAAGGCCT 
TAGACATGTA 
ACACCTCTTA 
GACGCAACCA 
ACCAATCCAT 
AAATAAAGCC 
ACTCATAACG 
CTGTCCCTAT 
CTCCGTTACT 
CTATGTAGTA 
GAATAATATA 
TCGTCCTATT 
ATCTGCTATC 
CTTCCATTGC 
GACTATACTT 
TGAAAAAGTG 
TACAAATCAC 
CATCATTTAA 
CATGCTCTCC 



GTAAAACGAT 
AGCCCACCAC 
GCTCCAGCAT 
AAACATCCTC 
AAGTTCTCTC 
TGTTTCCAAA 
ATCTCTCGTA 
CAAACGCTCC 
ACGATAGTTA 
CCGTGTCCCC 
GGTCAATACG 
AAAACGGTCC 
CCTCTACTTC 
CTTATTTCAT 
CATTTAGGAA 
TTCCAGATAA 
TTCATATTTA 
TTGTGATCTT 
GGGGAGGTGA 
CGGTACATAA 
TTCACCTTTA 



TAAATCCTGA 
AGCATCACCA 
CTCTGGTTGA 
TAGCTTGACA 
AAGCTCTTGT 
CGATCTTTGA 
CGCCCCTTAT 
GCAGCACGTC 
AGCTCTTCAA 
CTGAGTTTTG 
CATGAGAGAA 
TTACGAGCCA 
TAAAATATTG 
TTTAATATAA 
CCCCTAACTT 
TCACTCGTAG 
TAATTCATTC 
TATTTTGTTT 
ATAAGTGAGT 
TAAGTTGTAC 
GCAAAGGTTG 



ACCAATACTG 
TCAAGTGCCA 
TGGGTATAAT 
ATTTCTTGCT 
ACTGCTGTCC 
CTTTGGGCCA 
AAACCACTGC 
TGGATGGCAG 
AGACATAGCG 
GAGAAAAAGT 
AAATTCCCAA 
ACATCTGACT 
TTAGAAATCG 
TTGATAGTGG 
AAGCAATCCC 
GCGAGTACGC 
CTTTCGTTTC 
TATAATAATA 
GGTACAGCCA 
CATCTGAATA 
GAG 
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ATGAGAATCC 120 

CATAAATTAC 180 

TAGTCCAACC 240 

TTTTAATAGT 300 

ATTTTTATCA 360 

GTCCTTATCA 420 

CTGACGGAAG 480 

GTTAAGAGCA 540 

AGCCAGTAGA 600 

GACAGCTCCC 660 

AGCCTTACCA 720 

ACTATAGTAG 780 

ATTTGACTTC 840 

TCGCCCCAGC 900 

CATAATCGTC 960 

AAGCGCTCAT 1020 

ACTCAAGGCA 1080 

GTGAGAAAGC 1140 

CTACCTCGCA 1200 

AGTTGCTACA 1260 
1303 



(2) INFORMATION FOR SEQ ID NO:214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 



CTCTTCAATC 
ACATGGTAGG 
ATTAGGGATA 
CGAAAGTTGG 
AATGAACCAA 
TCGTAAAAAA 
TTTCCAAATC 
GCTAAACCAC 
CTATCTTACT 
TCCAACTACC 



AACTCACGCT 
AACCAAGCAC 
ACTGCATATA 
GTTTGCCATT 
AAATAGTACA 
GTTCTCTCTT 
TCACGTAATA 
CAAGTAAAAG 
GACAATCCAG 
ATCCACTATG 



TTAGGGCTTC 
CATTTGGTTC 
CGCCATAGCG 
GCATTTTCCT 
CAATGTGGTA 
ATTGTGAGCG 
CCATTTATGG 
GAACAAAACC 
ACGCTTATTT 
CACTCAAAGC 



CTGATGATTT 
TTGAACAAGA 
AGCAATATAG 
CATTATCTAG 
TAATCTTTTT 
AACAGGTAGT 
CTGGTTAAAG 
AAGAAAAGTT 
GACTGAAATA 
TATGGGCTAC 



TCACCTGCTT 
ACAATTTGTT 
TCTGTATTCG 
TATCGTTATT 
ATGGCATATT 
ATAACAGAAG 
CTAAAAGAGA 
GATAGAGATA 
GCTTCTGAAT 
ACTCGAAAAA 



CAATTTCTCC 
TTTGTTCAGG 
CTTTTTTCTC 
ATTATAGTGA 
CAATAGATTT 
CATCACACGT 
AAACAGGAGA 
GACTTAAAAA 
TTGGCTGTCA 
AAGAACCACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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CCTACTATGA ACAAGACCCA GAAAAAGTAG 
AGCACCTAGC ACAGGTCTAA CAAATGGTGA 
GATGACGAGC GACTTTTTTG AAGTATGGTT 
ACCATCGGTT ATTATAGTGA AATGAAATAA 
TCGAATTCTA ACAATGTTTT AGAAGTAGAG 
TAATGGACAA TGCAAGATTC CATAGAATGG 
GGCATAAACT TTTACCTTTT CCTCCCTACT 
AGTACATAAC TGCTTCTAAA ACATTCTTAT 
TCAGTTCTTA TTTCATTTTG CTCTACAATC 
AGGTATTGGC AAGTTGCAAT ACCTTTTTAC 
TGACTATATC TCCTATGGTT CTAGTTCAGA 
GTATCATTCC AAGTATTGAG AGTGAATGTT 
CTGGCATTTG CTAGACCGCC ATCTTTACGA 
CGAAGACTGG CAGTAAGATT GGCGCCGTGT 
TCTTTTAATG ATTTGATAAA TTGGATGGTC 
CCGAACATTC GAGTGTCAAA TTGAGCAAGA 
TAAATAG 
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CCTTATTTCT TAAGAATTTT ANTAGTTTAA 660 

ATTAATCGCT CCAATGACTT ACGAAGAGAT 720 

TCAGAAGTTT CTCTTACCAA CATTAACCAC 780 

GAACAGGACN AATCGATCAG GACAGTAAAA 84 0 

GTGTACTATT CTAGTTTCAA TCTACTATAT 900 

GTAAGCTAGA GTTCTTATGT GAAGAGTTTG 960 

CATCTTAGTA TAGAAAAGTG AATCTAAAAT 1020 

AAATTGATTT AAATTCTCAA ATCATATTAT 1080 

CTGTTGAGAA GACACGTGTT CATATCAAAA 1140 

AAGGTTCTTT TGTCTTATTT TTGTTTCAAC 1200 

AGGCTAGGCT ATAATTATGA TTGATAAGAA 1260 

TCAAAATCAT GGGTTTCTAT AATGGTCAGG 1320 

AGAAGTGGTT CTTTATAGCC TAGGAGAGTA 1380 

CCGACAATTA GAATACGCTC AGCTGGACTA 1440 

CGCTGAGTTG TACTATAGAG GGATTCGGCT 1500 

TTTGAACGAA AAGCCTGGAT TTGTTGCGGG 1560 

1567 



(2) INFORMATION FOR SEQ ID NO: 215: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pair -j 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



CTGTTTCCAT 


AGCATGACTT 


CTGTACTAGA 


CTTTCTTTTC 


CGAATAAATA 


GATAGAACCA 


60 


CAGAATCTAG 


TAAACCTAGA 


ATTAAAATTA 


TGGTATAATA 


TTAGCAATAA 


AAGAAATCTG 


120 


GAGGATTAGA 


ATCATGGTAT 


CAACGAAAAC 


ACAAATTGCT 


GGTTTTGAGT 


TTGACAATTG 


180 


CTTGATGAAT 


GCAGCAGGTG 


TGGCTTGTAT 


GACGATAGAG 


GAGTTAGAAG 


AGGTCAAAAA 


240 


CTCAGCGGCA 


GGAACCTTTG 


TTACTAAGAC 


AGCGACCTTG 


GACTTCCGTC 


AGGGGAATCC 


300 


TGAGCCACGC 


TACCAAGATG 


TTCCACTTGG 


TTCCATCAAC 


TCTATGGGCT 


TGCCAAATAA 


360 


TGGCTTAGAC 


TATTATTTGG 


ATTATCTTTT 


AGATTTGCAG 


GAAAAAGAGT 


CGAACCGAAC 


420 


TTTCTTCTTA 


TCTCTGGTCG 


GCATGTCTCC 


AGAGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


480 


CAGAGA 












486 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 



CTCTATCAGG 


AAAGTTAAAT 


TAATTTATAG 


AAATATTTTA 


GTATTTAAGC 


CCTACTGTTA 


60 


TAGATTCAAT 


ACACTATATA 


TGTGTTTGTC 


TGATAAAAAT 


TTCTACTCTT 


TTTGATTTTA 


120 


AATAAGTATT 


AGTTTACATT 


ATGGTGTGAA 


TTGGGTTTGA 


TATCTCTTTT 


GAGGAAGTTG 


180 


CCTTAGATTT 


TTCTGATTGT 


GTTTTATTGT 


ACAGTGTATC 


TTGCTTGTTT 


TGAACAGAAT 


240 


TTTTTATGAC 


ATTTGTCATA 


TTTTCTAGTG 


ACAGAAGCTT 


CTGCCTCCTC 


TGATTTTAAA 


300 


AGACTATAAT 


TGTAGTATGA 


AATGGGGGAA GAAGAGATGA GAAATAAAAT 


GATTATCGCA 


360 


GTAAGTTTAG 


TAGTAGCAGG 


AGTTATGACC 


TATCTCATGT 


TTTCGGGATT 


GGATGAGAAT 


420 


TTCTACCATT 


TTCCTTGGGA 


GGTCTTTGCT 


GGCTTTGGAA 


TCATCTTCTT 


GGCTTGTCCA 


480 


GAGAAGGTTT 


GAAATTAGTA 


AGAGATGTGA 


AAAAGGAGTT 


TGAAAAATGA 


AAAAAGCAAC 


540 


TATCTATTTC 


TTTATCGGCC 


TGTCACTCTT 


GGTATGGTTG 


GTAGAAATGT 


TTACAGGTTG 


600 


GTTTGCTCAA 


ACCTTCCTTC 


ACCAGTTCAT 


CCGTGGTGCA 


TGGGGATTAG 


GATTTATGAT 


660 


TTTTATCGCC 


TTTCCGATGG 


GAAAGGAGTT 


GCTGGAAGGA 


GAATATCATG 


AACATGATTA 


720 


AGGTTCAAGG 


ACTACATAAA 


AATATTAAGG 


GCAAGGCTAT 


TTCAAAGGAT 


ATTTCCTTTG 


780 


AAGTAGCAGA 


AGGTGAATGC 


GTTGCCTTGA 


TTGG 






814 



(2) INFORMATION FOR SEQ ID NO: 2 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



CTTGGTACGA 


CCGTATGGGT 


GGTCACTGAA 


AGTGGGAAAC 


TTCCAAGATG 


GGCACTGTGT 


60 


ACGGATCCCC 


GTAAACTGTC 


GCANAAGAAC 


TGAAGATGAT 


GTTTTTACAG 


TTGTTTTCTT 


120 


CCATGGCTTT 


CAAAAGGCTG 


ACAGTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


180 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CAGCAAAGTG 


AATGACACCA 


GTCAGTTCTT 


240 


CCTGCTTGAA AATATCTCTG 


AGGGTATCTG 


TGTCACNAAT 


ATCTGCCTCA 


TAGAAAGGAA 


300 


TCTCAACTCC 


TGTGATTCTT 


TCAACAACTT 


CTAAACTCTT 


ACNATTGCTA 


TTGACAAGAT 


360 


TATCCACCAC 


AACAACTTGA 


TGGCCTGCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


420 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


CGATTCTCAG 


480 


ATTATTTTTT 


CTTATTTTAA 


CCATTTTTGA 


CAGGGAATGT 


CATTTGCCAC 


CTTAAACTAC 


540 
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CTGATAAAAT 


TTCAGTAAAA 


TGCTTATACT 


CTTGGAAAAT 


CCAATTCAAA 


CCACGTCAAC 


600 


GTCGCCTTGC 


CATGGGTATG 


GTTACTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAGCAG 


660 


CGCTTTGAGT 


AACCCGCGGC 


TAGTTTCNTA 


GTTTGTTCTT 


TGATTTTCAT 


TGAGTATTAT 


720 


TCGCTTTTTA 


CTCGTTTGAC 


ATAGTTTTCA 


ATTGGGTAGT 


TTTCAGGGTC 


CAAGGTCAAC 


780 


TCCTTGTCTT 


GGATCAGTTG 


AGCTAAATGG 


TAACCAATAA 


TAGGACCAGT 


TGTGAGGCCT 


840 


GATGAACCTA 


GTCCACTGGC 


TGCATAGACA 


CCAGTTAAGT 


CAGGCACCTG 


CCCAAAGAAA 


900 


GGAGAGAAAT 


CACTGGTGTA 


GGCACGGATT 


CCAACACGCT 


CAGATTTTGA 


AGTAGCTTCA 


960 


GCCAAAATCA 


GATAGTGAGT 


CAAGGTGGCC 


TCCTCCATTT 


GTTGGAGCAA 


GGTTTCATCT 




ACCGTCAAAT 


CAAATCCCAT 


GTCATTTTCG 


TGGGTAGCGC 


CTAAGGACAA 


TTTCCCACCT 


1080 


GCAAAAGGAA 


TCAAATCCCA 


CTCCCCTTCT 


GGCATGACAA 


CAGGGTAAGC 


TTCCATGTCT 


1140 


TGGACCAGCT 


GATAATCTCG 


TAGTTGTCCT 


TTTTGAGGAC 


GAACATCTAC 


TTCATAACCC 


1200 


AAGGGTTCTA 


ACATGTCCCC 


CAACCAAGCT 


CCNGTCGCCA 


AAATAACCTG 


CTCAAACTCC 


1260 


TCTTCACCAA 


TCTGGTAGCC 


TGATGCTAAC 


GGTGTCAGAG 


TCACTTTTTC 


TTTGACCAG 


1319 



(2) INFORMATION FOR SEQ ID NO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: 3 i near 

<ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



CTCAAAATAC 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGC 


GGTTTAAAGA 


GATTTTCGAA 


60 


GAGTATAAAC 


TGCTTATAAA 


ATAAAAAGGA 


GCCCTGATGG 


AACACATTAT 


TTATCAGTTT 


120 


GAAGAGGACT 


TGGCAATCCT 


TACCTTGAAC 


CGTCCTGAGG 


TCGCAAATGG 


TTTTCATATT 


180 


CCCATGTGTG 


AGGAGATTTT 


AGAAGCTCTG 


ACTTTGGCAG 


AAGAAAATCC 


AGCTGTGCAT 


240 


TTTATCTTAA 


TCAATGCCAA 


TGGAAAGGTC 


TTCTCAGTTG 


GGGGAGATTT 


GGTAGAGATG 


300 


AAGCGGGCAG 


TGGATGAGGA 


TGATATTCCA 


TCATTGACAA 


AAATCGCAGA 


GTTGGTCAAT 


360 


ACGATTTCTT 


ATAAAATCAA 


GCAAATAGCC 


AAACCTGTTT 


TTAATGGAAG 


TTGACGGTGC 


420 


TGTTGCAGGT 


GCCGCAGCGA 


ATATGGCTGT 


TGCGGCAGAT 


TTCTGTTTGG 


CGACGGATAA 


480 


GGCTAAGTTT 


ATCCCAAGCC 


CTTTGTTGGT 


GTGGTTGGCT 


CCAGATTCCA 


GGGG 


534 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: G nomic DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 219: 



CTGCAGCCGG AAGTTATCTG GTTACAAAAA ATCGCCAAAC CATCACAGAT GAAAGTCTTG 60 

AATACCACTG ACCGCGTTCA AGCTATCAAG GACGATGTGG ATATTATCCA AAACAGCCTG 120 

CAAATCATTA ACCAGCAAAA AGAACTTATC AAGGAATACC AAGAAGACTT GACTTACAAG 180 

TTTAAGGTCT TGGAAAAGGA TATCCAAACT AGGACTAGCT GTGATAAAAG AAATGCAGGG 240 

AACTGAAGAT AAGTAAAAAG AGCCCGACGG CTCTTTTTTA AAACGTAAAC AAGTTATAGT 300 

GAATTGAATT TAGAATAGCA CATCATAGTT TCTAAAGCAT TGTTAGAGAT TACTTTAAAA 360 

TCCTCTTATC AATTTGTTCA TATTCTATTT CAATCTACTA TATATATCCC ATCAACTATG 420 

CATCATAATT TAGGTAACTC ATACTCAATA AAAATCAAAA AGCAAACTAG AAAGCTAGGC 480 

ACAGACTGCT CAAAACACCG TTTTAAGGTT GTGGATAGAA CTGACGAAGT CAGTAACCAT 540 

ACCCATGACA AGCCGACGCT GACGTGGTTT GAAGAAATTT TTGAAGAGTA TCATTCACCA 600 

TTTCACAACT AAATCGAAGA CTTTCTGCCG TAATGAAAAC ACCTGAAACA GCTTGGTTTC 660 

AGCTGTCCGG AAACTTTAAG ACTTAGGTTC AAAGTTTAGG TATGGAACTT CGAAGGAGGT 720 

CGCTACCGTC CGTCATTACT TAGNGAAGNC TTAAAAAATC TATAAAGTAA AAAGAGCCCG 780 

ACAG 784 



(2) INFORMATION FOR SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 



CTACTCGTTT TAGTATAGTA AACTGAATCT AGACTAGCAC AATGCNGCTT CTANAGCGTT 60 

TCGAAACTAG AATAGGACAA CATAACTGCT AAAAGATTTC TATAAATTCA TTTGATTTTC 120 

CTAATCAATT TGTTCGTATC CTANTTCACT CCACTATAAA ATATTCTTAT CAATTGATTT 180 

GNATGCCAAA ATTCCATCGT TCAGGTCTTA TTTCATTNGG CTNTAAAAAA CTCTCCTGGG 240 

NAATCCCAGG AGAGNATGAT TACNTATTTG AACTTGAATC CTTCGTANAT AAGCTCTGTT 300 

TTTGGATTTT GTTTCTTAAT CTGTTTGGCA AGTGCCTTCA TCATAGAAAT AGGACCACAC 360 

ATATAGACGG TTGCATGTTC GGGCACTTTT TTTGTTCAAA ATTAAGATAG CCGTCTTTCG 420 
TACTGTCGAT TAGATGGAGT TCAAAATTAG GATTTTTCTG AGCATAGTTA CGGAGTAAAT 
CTAGGTAGAC TGCATTTTCA TCTCCACGGA AGCTATAGTA GAAGTGAACC TGTTTATCTA 
AAATAGGATG TTCACGGATG TAAGAGATGA AGGGGGTGAT CCCAATACCT CCAGCAATCC 

AAACCTGATT TTCTCGTCCT TCTTCTATGA TCATGTGTCC GTAAGCTCTG TCTAGGGTTA 660 

CTTTGCTGCC GGCTTGAAGA TTATCATAGA TATTTCTTGG TATGGTCGCC TGAATTTTTA 720 
ACAGTAAAGT TAAAGAGTTT GAACCCATGA ACTCCCTGAG AATAGAAAAG GGATGCCGGA 



480 
540 
600 



780 
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ACCACTTTCA AAGCCTTCTT GGAAAATCTT 
GAAAGGTCTG CTAAGATGGA TTTGAATTTC 
GGTAATTTTC CCTAGATAGG GGAAGGAAAT 
AG 



PCIYUS97/05306 

TAGAAAGGCA AATTGTCCTG ATTGATAGTT 840 
TCTAGTATCG TGATTTAAGC GTTTGAGATG 900 
CTTTTGATAT AGAAAAATGA TATAAAAACC 96C 

962 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 910 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CTCTTTCCAC 


TATTATAACA 


CGAAAGAAAG 


AATTGTCAGG 


GAACTGTACA 


GATTTCTTTT 


60 


CTATCTATTT 


ATAACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


GGAAACTAGC 


CGCAGGCTGT 


120 


AATTGAGTAC 


GGCAAGGCGA 


CGTTGACGCG 


ATTTGAATTT 


GATTTTCGCA 


GAGTATTATT 


180 


CGTAAAAAAT 


CTCAAAAAGC 


CTACCTTTCG 


GTAGACTTAG 


TTTGTTTCTA 


TTCTAATCGG 


240 


CACTCTTCC . 


AAATTTTGCT 


CTGCTATACT 


TGGCTTTCCT 


AGTTGGTAAA 


TCTGGTCAAC 


300 


CTTTTGAGTC 


ATAGCATCCC 


AAGGTTCTTT 


GCCAATTCGG 


CTGACTAGAT 


TGACCTGTCC 


360 


TTTCAGAGAC 


TTGAGATGTT 


GCCTGCCTTT 


TTCAGTAAAT 


CCAAGGACAT 


GAATGGCTTC 


420 


TGGCAAGTCA 


CTTTCTCTAG 


CCTGCATCAA 


AATATAGGTC 


AATAGGCGTC 


TGACACGCGC 


480 


CTTGGTGTAA 


CGTTTGGTAG 


TCACTAACTC 


GACCAATTCT 


TCCACAGACT 


GGGCTGTTTT 


540 


AATAGCTTCT 


TTAATGCGCA 


CAGCCATTTC 


TTGATTGACC 


TGATAGATGG 


TGGTTAGGTC 


600 


GGGATTTGAC 


AAGATTTGAT 


AGCGGAGCAA 


GGGAAAATAG 


TCTTCCCAGA 


TCACCTTACT 


66C 


GGCTTGCTCA 


AATAGGGCAA 


CAGAAGGCAT 


AAAACGTTCT 


AAGAAATCTT 


GGTCCTTCTG 


720 


ATGTTGACGG 


AGGGCTGTCG 


CCGAGGCAAA 


GTCCACATCT 


TTATTCACAG 


AATGGTAACC 


780 


TGCCCCCTGA 


CGCTGAATCG 


GATGCAGCTT 


GATATTCCGT 


CCTGCAACCG 


CCTTGGCATA 


840 


GGCCAAAACA AAGAACATGA 


TTGGGGTGTA 


TTACCTGAAA 


AATCAAGACC 


AGCAAATTCC 


900 


TTCCACATAG 












910 



(2) INFORMATION FOR SEQ ID NO: 2 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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CGCCAAGTCA TCCAACAAGT CCAGAACGGT TCTGGGGCTT GGGATCCTCA GTCGCTGGAG 60 

ATAACTCCTT TGGGCTTGTT CATCATGTAG TAGACAAACT CTTCATACTC CAACACTTGC 120 

CCATCAAAAG CGAATCTCAT CTATTTTTTC ATCAATCTGC AATTTAGCTG ATTTTTCTTT 180 

TTTACCATTT ACAGTCACGC GCCCAGCCTT GAGCAAGTTT TTGACCTCAG TCCGACTTCC 240 

CACCGCACAG GCAACTAAAA ATTTATCTAA TCTCATAGAA CTATTATATC ATATCAAAAG 300 

GAGGCTGGTA CAATGACCAA CCTCCTTTTC GTTTCATACT CTTCAAAAAT CTCTTCAAAC 360 

CGCGTCAACG TCGCCTTGCC GTATATATGT TACTGACTTC GTCAGTTCTA TCTGCAACCT 420 

CAAAGCAGTG CTTTGAGCAA CCTGCGGCTA GTTTCCTAGT TTGCTCTTTG ATTTTCATTG 480 

AGTATCAGAT TTAGGAAATT AACTTCCTCG TCTCCAAAAA ATAGCTAAGA CAATCATGGC 540 

ACCTAAAACA GCTGGGATAA TAGCTGTTCC TGATAAAACT GGCCCCCAAG TTCCAAAGAG 600 

CAAGTGACCT AGAAAGGCTC CGATCCAACC GAGAAACATT TTTCCAAAAC ATCCCATTCG 660 

CTCTCCACGA TTGGTCATAG CACCTGCTAA AAATCCCACT AGGAGACCAA CGAACATACT 720 

TCCTAACATA TTATCTCCTT AATTTGCCCA ATTCCCATTA CGGAAAAGAA GTACTCGCGT 780 

TCCATCCTCA CGAATACCAT CGATATCCAT TTGGTTAGAA CCAATCATAA AGTATACGTG 840 

AACATCTGAA CGGTTAAGCC CTGCAG 866 



(2) INFORMATION FOR SEQ ID NO: 223: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 693 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 223: 



60 



CTTCAGGATG TGGGAAAACG ACCCTTCTTC GTATGATTGC AGGTTTCAAC AGTATCAAAG 

ATGGAGAATT TTACTTCGAT GATACAAAAA TCAATAATAT GGAACCCAGC AAACGCAATA 120 

TCGGGCTGGT TTTCCAAAAC TACGCTATTT TCCCACATTT GACTGTCCGA GACAACGTTG 180 

CTTTTGGTCT TATGCAAAAG AAGGTTCCAA AAGAAGAATT GATTCAACAG ACCAACAAGT 240 

ATCTTGGACT CATGCAAATT GCTCAATATG CGGATCGAAA GCCCGATAAA CTCAGTGGTG 300 

GACAACAACA ACGTGTCACC TTGGCATGCG TCTTAGCGGT TAATCCAAGT GTTCTCCTCC 360 

ATGGACGAAG CCACTTAGTA ATCTGGAGGC CAAACTTCGC TTGGATATGC GTTCAAGCCC 420 

ATCCCGAGAA ATCCAACCAC GAAGTTGGGG AATTACAACT GTTTATGTAA CCCACGACCA 480 

AGAAGGAGCC ATGGCTATTT CAGACCAAAT TGCCTGTTAT GAAAGATGGG GTGATCCAAC 540 

AAATCGGCCG ACCAAAAGAA CTGTATCATA AACCAGCTAA TGAGTTAGTG GCAACCTTTA 600 
TCGGACGCAC AAATATTATC CCTGCCAATC TTGAAAAACG GAGCGACGGC GNTTATATCG 
TCTNTTCAGA TGGANANGCC CTTCGAATGA TAG 



660 
693 



(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 : 



Met Ser Met Phe Thr 

1 5 
Cys Asn Leu Lys Gly 

20 

Ser Gly Lys Ser Thr 
35 

lie Asp He Ala Lys 
50 

Val Asn Arg Pro Glu 
€5 

Met Val Phe Gin Arg 
85 

He Thr Phe Ala His 
100 

Asp Glu He Val Glu 
115 

Val Lys Asp Asp Leu 
130 

Gin Gin Arg Leu Cys 
145 

Leu Leu Met Asp Glu 
165 

Ala Thr Arg Arg Asp 
180 



Met Val Lys Met Asn Pro 
10 

Asn Lys He Thr Ala Leu 
25 

Tyr Leu Arg Ser Leu Asn 
40 

Val Thr Gly Gin He Leu 
55 

He Asn Val Tyr Glu Met 
70 75 
Pro Asn Pro Phe Ala Lys 
90 

Glu Arg Ala Gly Val Lys 
105 

Thr Ser Leu Ser Gin Ala 
120 

His Lys Ser Ala Leu Thr 
135 

He Ala Arg Ala He Ser 
150 155 
Pro Ser Leu Ser Leu Gly 
170 

His Val 



Leu Arg Gly Leu He 
15 

He Gly Pro Ser Gly 
30 

Arg Met Asn Asp Thr 
45 

Tyr Arg Gly lie Asp 
60 

Arg Lys His He Gly 
80 

Ser He Tyr Arg Asn 
95 

Asp Lys Gin Val Leu 
110 

Ala Leu Trp Asp Gin 
125 

Leu Ser Gly Gly Gin 
140 

Val Lys Pro Asp He 
160 

Ser Asp Cys Asp His 
175 



(2) INFORMATION FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 5: 

Met He Lys He Ser Asn Leu Ser Lys Ser Phe Ser Gly Gin Thr Val 

15 10 15 

Leu Asp His Leu Asn Leu Asp He Gin Lys Gly Glu Val Val Ala Leu 

20 25 30 

He Gly Ser Ser Gly Ala Gly Lys Ser Thr Phe Leu Arg Ser Leu Asn 

35 40 45 

Tyr Leu Glu Thr Pro Asp Ser Gly Ser He Gin He Asp Gly Phe Ser 

50 55 60 

Val Asp Phe Ser Lys He Thr Gin Glu Glu He Leu Ala Leu Arg Arg 
65 70 . 75 eo 

Lys Leu Ser Met Val Phe Gin Gin Phe Asn Leu Phe Glu Arg Arg Thr 

85 90 95 

Ala Leu Asp Asn Val Lys Glu Gly Leu Val Val Val Lys Lys Leu Ser 

100 105 110 

Asp Gin Glu Ala Thr Lys He Ala Lys Glu Glu Leu Ala Lys Val Gly 

115 120 125 

Leu Ser Asp Arg Glu Asn His Tyr Pro Arg His Leu Ser Gly Gly Gin 

130 135 140 

Lys Gin Arg Val Ala Leu Ala Arg Ala Leu Ala Met Lys Pro Asp Val 
145 150 155 160 

Leu Leu Leu Asp Glu Pro Thr Ser Ala Leu Asp Pro Glu Leu Val Gly 

165 170 175 

Glu Val Glu Lys Ser He Ala Asp Ala Ala Lys Ser Gly Gin Thr Met 

180 185 190 

He Leu Val Ser His Asp Met Pro Phe Val Ala Gin Val Ala Asp Lys 

195 200 205 

He Leu Phe Leu Asp Lys Gly Lys He He Glu Ser Gly Thr Pro Asp 

21° 215 220 

Glu He He His Thr Pro Lys Glu Glu Arg Thr Lys Glu Phe Phe Thr 

225 230 235 240 

Ser Tyr Lys Arg Thr Tyr lie 

245 



(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

Met Phe Ser Leu Arg Ser Val Phe Asp Gly lie Pro Arg lie Val Gin 

15 10 15 

Gin Leu Pro Thr Thr lie Met Leu Thr lie Gly Gly Ala Leu Phe Gly 

20 25 30 

Leu Val Leu Ala Leu Leu Phe Ala lie Val Lys lie Asn Arg Val Lys 

35 40 45 

lie Leu Tyr Pro Leu Gin Ala Phe Phe Val Ser Phe Leu Lys Gly Thr 

50 55 60 

Pro lie Leu Val Gin Leu Met Leu Thr Tyr Tyr Gly lie Pro Leu Ala 
65 70 75 80 

Leu Lys Ala Leu Asn Gin Gin Trp Gly Thr Gly Leu Asn lie Asn Ala 

85 90 95 

He Pro Ala Ala Ala Phe Ala He Val Ala Phe Ala Phe Asn Glu Ala 

100 105 110 

Ala Tyr Ala Ser Glu Thr He Arg Ala Ala He Leu Ser Val Asn Pro 

115 120 125 

Gly Glu He Glu Ala Ala Arg Ser Leu Gly Met Thr Arg Ala Gin Val 

130 135 140 

Tyr Arg Arg Val He He Pro Asn Ala Ala Val Val Ala Thr Pro Thr 
145 150 155 160 

Leu He Asn Ser Leu He Gly Leu Thr Lys Gly Thr Ser Leu Ala Phe 

165 170 175 

Ser Ala Gly Val Val Glu Val Phe Ala Gin Ala Gin He Leu Gly Gly 

180 185 190 

Ala Asp Tyr Arg Tyr Phe Glu Arg Phe He Ser Val Ala Leu Val Tyr 

195 200 205 

Trp Val Val Asn He Gly lie Glu Ser Leu Gly Arg Phe lie Glu Arg 

210 215 220 

Lys Met Ala lie Ser Ala Pro Asp Thr Val Gin Thr Asp Val Lys Gly 
225 230 235 .240 

Asp Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

188 



WO 97/37026 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

Met Tyr Arg He Asp Asp Asp Glu Gin Phe Val Leu Asp Phe Leu Lys 

15 io 15 

Gin Glu Lys Val Leu Leu Val His Gly Arg Gly Phe Asn Trp Gin Glu 

20 25 30 

Pro Asp His Phe Arg He Val Tyr Leu Pro Arg Val Asp Glu Leu Ala 

35 40 45 

Gin He Gin Glu Lys Met Thr Arg Phe Leu Lys Gin Tyr Arg Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: liner. - 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Met Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg 

15 10 15 

Leu Val Gly Val Gin Arg Thr Thr Phe Glu Glu He Leu Ala Val Leu 

20 25 30 

Lys Thr Ala Tyr Gin Leu Lys His Ala Lys Gly Gly Arg Lys Pro Lys 

35 40 45 

Leu Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu 

50 55 ' 6 o 

Tyr Arg Thr Tyr Glu Gin He Ala Ala Asp Phe Gly He His Glu Ser 
65 70 75 80 

Asn Leu He Arg Arg Ser Gin Trp Val Glu Val Thr Leu Val Gin Ser 

85 90 95 

Gly Val Thr lie Ser Arg Thr Pro Leu Ser Ser Glu Asp Thr Val Met 

100 105 110 

lie Asp Ala Thr Glu Val Gin He Asn Arg Pro Lys Lys Arg He Ser 
115 120 125 
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Glu Ser Phe Trp 
130 
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(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 229: 

Met Arg Arg Lys Tyr Lys Ser lie Ala Leu Lys Lys Glu Leu Ala Asn 

15 10 15 

His Ser Gly Lys Lys Lys Phe His Ala Met Lys Ala Gin Ala lie Val 

20 25 30 

Thr Ser Gin Gly Arg lie Val Ser Leu Asp lie Ala Val Asn Tyr Ser 

35 40 45 

His Asp Met Lys Leu Phe Lys Met Ser Cys Arg Asn lie Gly Gin Ala 

50 55 60 

Gly Lys lie Leu Ala Asp Ser Gly Tyr Gin Gly Pro Met Lys lie Tyr 
65 70 75 80 

Pro Gin Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu lie 

85 90 95 

Ala Glu Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys 

100 105 110 

Val Glu Asn lie Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr 

115 120 125 

Thr Tyr Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu lie 

130 135 140 

Ala Gly lie lie Asn Tyr Glu Leu Gly Phe 
145 150 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
{B) TYPE: amino acid 
<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Met Val Ser Ser Ser Gly Ser Glu Phe Gin Ser Gly Trp Gin Glu His 

15 10 15 

Gin Leu lie Ala Glu Lys Val Ser Lys Thr Leu Asp Lys Thr Phe Asp 

20 25 30 

Lys Asp Val Arg Lys lie Pro Thr Ser Pro Val Leu Ser Lys lie Cys 
35 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOJ >CULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Met He Lys He Leu Ala Ala Cys Gly Ala Gly Val Asn Ser Ser His 

1 5 10 15 

Gin He Lys Ser Ala Leu Glu Glu Glu Leu Ser Asn Arg Gly Tyr Asp 

20 25 30 

Val His Cys Asp Ala Val Met Val Lys Asp Val Asn Glu Asp Leu Met 

35 40 45 

Lys Gly Tyr Asp He Phe Thr Pro lie Ala Ala Thr Asp Leu Gly Phe 

50 55 60 

Glu Pro Gly He Pro Val He Glu Ala Gly Pro He Leu Phe Arg He 
65 70 75 80 

Pro Ala Met Ser Ala Pro Val Phe Asp Asn He Arg Leu Pro Ala Lys 
85 90 95 

Gin Asn Met Val 
100 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



PCT/US97/05306 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 

Met Asn Thr Lys Met Met Ser Gin Phe Ser Val Met Asp Asn Glu Met 

15 10 15 

Leu Ala Cys Val Glu Gly Gly Asp lie Asp Trp Gly Arg Glu lie Ser 

20 25 30 

Cys Ala Ala Gly Val Ala Tyr Gly Ala lie Asp Gly Cys Ala Thr Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Met Arg Glu Lys Glu lie Phe Asp Ser lie Val Thr He He Gin Glu 

15 10 15 

Arg Gin Gly Glu Asp Phe Val Val Thr Glu Ser Leu Ser Leu Lys Asp 

20 25 30 

Asp Leu Asp Ala Asp Ser Val Asp Leu Met Glu Phe He Leu Thr Leu 

35 40 45 

Glu Asp Glu Phe Ser He Glu He Ser Asp Glu Glu He Asp Gin Leu 

50 55 60 

Gin Ser Val Gly Asp Val Val Lys He He Gin Gly Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 40 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

Met Ala His Gly Asp Leu Leu Tyr His Asp Gly Leu Phe Phe Ser Ala 

Lys Lys Glu Asp Gly Thr Tyr Asp Phe His Glu Asn Phe Glu Tyr val 

20 25 30 

Thr Pro Trp Leu Lys Gin Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Met Ser Leu He Thr His Arg Arg Phe He Ser Cys Asn Glu Asn lie 
15 io 15 

Lys His Tyr Lys Arg Leu He Asp Lys Ala Glu Lys Cys Val Asn Asp 

20 25 30 

Leu Met Ala Glu Leu Asn Ser Val He Thr Thr Val Thr Gly He Glu 

35 40 45 

Asn Arg Leu Gly Ala Val He Leu Ala Glu He Arg Asn He His Ala 

50 55 60 

Phe Asp Asn Pro Ala Gin Leu Gin Ala Phe Ala Gly Leu Asp Ser Ser 
65 70 75 80 

He Tyr Gin Ser Gly Gin He Asp Leu Val Gly Arg Met Val Lys Arg 

85 90 95 

Gly Ser Leu His Leu Arg 

100 

(2) INFORMATION FOR SEQ ID NO: 23 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6: 

Met lie Arg Ala Val Phe Phe Ser He Asp Asn Asp Met Glu Glu Ala 

15 10 15 

Ala Arg Ser Met Gly Ala Ser Ser Phe Tyr Thr Met Val Arg Val He 

20 25 30 

He Pro Tyr He Leu Pro Val Val Leu Ser Val Val Val Leu Asn Phe 

35 40 45 

Asn Ser Leu Leu Ser Asp Tyr Asp Leu Ser Val Phe Leu Tyr His Pro 

50 55 60 

Leu Phe Gin Pro Leu Gly He Val He Lys Gin Ser Thr Asp Glu Thr 
65 70 75 80 

Ala Thr Leu Asn Ala Gin Ala Met Met Phe Val Tyr Ser Val He Leu 

85 90 95 

Met He Met Ser Ser Ala Ala Leu Tyr Leu Ser Ser Leu Phe Gin Gly 

100 105 110 

Lys Arg Gly Lys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 237: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Met Glu Lys Cys Tyr Thr Asp Val Thr Glu Phe Ala He Pro Ala Ser 

15 10 15 

Thr Gin Lys Leu Tyr Leu Ser Pro Val Leu Asp Gly Phe Asn Ser Glu 
20 25 30 
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He He Ala Tyr Asn Leu Ser Thr Ser Pro Asn Leu Glu Gin Val Gin 

35 40 45 

Thr Met Leu Glu Gin Ala Phe Thr Glu Lys His Tyr Glu Asn Thr He 

50 55 60 

Leu His Ser Asp Gin Gly Trp Gin Tyr Gin His Asp Ser Tyr His Arg 
65 70 75 eo 

Phe Leu Arg Val Arg Glu Phe Lys His Leu Cys His Ala Arg Glu Thr 



95 



85 90 
Ala Lys Thr Thr Val 
100 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

Met Glu lie Leu Thr Thr Gin Gin Val Arg Glu Gly Tyr Gin Ala He 

1 5 io 15 

Leu Ala ser Pro Leu Gly Leu Gin Asp Ala Phe Glu Val Ala Gin Glu 

20 25 30 

Lys Ser Gly Ser Tyr Thr Val Pro Pro Glu He Asp Gly Trp Lys Gly 

35 40 45 

Asn Thr Glu Pro Leu Arg He Asp Tyr Val Phe Thr Thr Lys Glu Leu 

50 55 €0 

Ala Val Glu Asn Leu His Val Val Phe Asp Gly Asn Lys Ser Pro Gin 



65 70 75 

Val Ser Asp His Tyr Gly Leu Asn Ala Met Leu Asn Trp Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



80 
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(ii) MOLECULE TYPE : protein 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 : 

Met Leu Lys His Leu Asn Leu Lys Gly His Leu Leu Thr Ala He Ser 

15 10 15 

Tyr Met He Pro He Val Cys Gly Ala Gly Phe Leu Val Ala He Gly 

20 25 30 

Leu Ala Met Gly Gly Gly Val Pro Asp Ala Leu Val Ala Gly Lys Phe 

35 40 45 

Thr He Trp Asp Ala Leu Ala Thr Met Gly Gly Lys Ala Leu Gly Leu 

50 55 60 

Leu Pro Val Val He Ala Thr Gly Leu Ser Tyr Ser He Ala Gly Lys 
65 70 75 80 

Pro Gly He Ala Pro Gly Phe Val Val Gly Leu He Ala Asn Ser Val 

85 90 95 

Gly Ser Gly Phe He Gly Gly He Leu Gly Gly Tyr He Val Gly Phe 

100 105 110 

Leu Val Gin Ala lie lie Lys Lys Val Lys Val Pro Asn Trp lie Lys 

115 120 125 

Gly Leu Met Pro Thr Leu lie lie Pro Phe Val Pro Leu Trp 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

Met He Ala Leu Gly Trp Ser Asn lie Gly Ala Ala He Ala Pro Asp 

15 10 15 

Ala Ala Leu Ala Ser Val Ala Ala Ala lie lie Met Val Leu Gly Gly 

20 25 30 

Asp Phe Thr Lys Thr Gly lie Gly Val Ala Gin Ala Val Ala lie Pro 

35 40 45 

Leu Ala Val Ala Gly Leu Phe Leu Thr Met lie Val Arg Thr lie Ser 
50 55 60 
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Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 
65 70 75 80 

Gly Ala Val Glu Arg Ala His Phe lie Ala Leu Leu Phe Gin Gly Leu 

85 90 95 

Arg lie Ala Leu Pro Ala Ala Leu Leu Leu Met Val Pro Thr Glu Thr 

100 105 no 

Val Gin Ser lie Leu Ser Ala Met Pro Asp Trp Leu Lys Asp Gly Met 

115 120 125 

Ala He Gly Gly Gly Met Val Val Ala Val Gly Tyr Ala Met Val He 

130 135 140 

Asn Met Met Ala Thr Arg Glu Val Trp Pro Phe Phe Ala Leu Gly Phe 
145 150 155 160 

Val Leu Ala Ala Val Ser Asp He Thr Leu He Gly Phe Gly Ala He 

165 170 175 

Gly Val Ala He Ala Leu He Tyr Leu His Leu Ser Lys Thr Gly Gly 

180 185 190 

Asn Gly Gly Gly Gly Ala Ala Thr Ser Asn Asp Pro He Gly Asp He 

195 200 205 

Leu Glu Asp Tyr 
210 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:241: 

Met Leu Asp Val Glu Ala He Arg Lys Asp Phe Pro He Leu Asp Gin 

1 5 10 15. 

He Val Asn Asp Glu Pro Leu Val Tyr Leu Asp Asn Ala Ala Thr Thr 

20 25 30 

Gin Lys Pro Leu Val Val Leu Lys Ala lie Asn Ser Tyr Tyr Glu Gin 

35 40 45 

Asp Asn Ala Asn Val His Arg Gly Val His Thr Leu Ala Glu Arg Ala 

50 55 60 

Thr Ala Ser Leu 
65 
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(2) INFORMATION FOR SEQ ID NO: 242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Met Thr Lys Leu Leu Asn Lys Lys Trp Lys Val Lys He Met Lys Gin 

1 5 10 15 

He Leu Leu Val Cys Asn Ala Gly Met Ser Thr Ser Met Leu Val Lys 

20 25 30 

Lys Met Gin Gin Ser Ala Thr Glu Arg Gly He Glu He Ser He Gin 

35 40 45 

Ala Lys Ser Met Thr Glu Ala Lys Lys Asn He His Glu Ala Asp Val 

50 55 60 

He Leu He Gly Pro Gin He Arg Tyr Glu Leu Leu Ala Val Lys Glu 
65 70 75 80 

lie Ala Gly Asn He Pro Val Asp Thr lie Asp Met Arg Asp Tyr Gly 

85 90 95 

Met Met Asn Gly Ala Lys Val Leu Glu Gin Ala Leu Glu Trp lie Gly 
100 105 HO 

Glu lie Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 243: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

Met Val Leu Met Gin Lys Met His Gly Lys Glu Leu lie Thr Leu Asn 
15 10 15 
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Gin Thr Val Lys Trp Tyr Lys Val 
20 

Lys Pro Trp Tyr Tyr Leu Lys Ser 

35 40 
Trp Gin Gly Asn Tyr Tyr Leu Lys 

50 55 
Glu Trp Val Tyr Asp Ala Thr Tyr 
65 70 
Asp Gly Ser Tyr Ala Tyr Ser Thr 
85 

lie Gly Trp 



PCT/US97/05306 

Ser Gly Phe Met lie Leu Leu Thr 
25 30 
Asp Gly Ser Tyr Ala Arg Asn Ala 
45 

Ser Asp Gly Lys Met Ala Val Asn 
60 

Gin Ala Trp Tyr Tyr Leu Thr Ser 

75 80 
Trp Gin Gly Asn Tyr Tyr Pro Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Met Leu Thr Val His Gly Lys Glu lie Thr He Leu Lys Ser Asp Gly 

1 5 io 15 

Lys Met Ala Val Asn Glu Trp Val Asp Gly Gly Arg Tyr Tyr Val Gly 

20 25 30 

Ala Asp Gly Val Trp Lys Glu Gly Gin Ala Ser Thr Ala Ser Pro Ser 

35 40 45 

Asn Asp Ser Asn Ser Glu Tyr Ser Cys Cys Phe Arg Lys Gly Lys Lys 
50 55 60 

Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 245: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 225 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: prot in 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

Met Asn Asn Asn Phe Asn Asn Phe Asn Asn Met Asp Asp Leu Phe Asn 

1 5 10 15 

Gin Leu Met Gly Gly Met Arg Gly Tyr Ser Ser Glu Asn Arg Arg Tyr 

20 25 30 

Leu lie Asn Gly Arg Glu Val Thr Pro Glu Glu Phe Ala His Tyr Arg 

35 40 45 

Thr Thr Gly Gin Leu Pro Gly Asn Ala Glu Thr Asp Val Gin Met Pro 

50 55 60 

Gin Gin Ala Ser Gly Met Lys Gin Asp Gly Val Leu Ala Lys Leu Gly 
65 70 75 80 

Arg Asn Leu Thr Ala Glu Ala Arg Glu Gly Lys Leu Asp Pro Val He 

85 90 95 

Gly Arg Asn Lys Glu He Gin Glu Thr Ser Glu He Leu Ser Arg Arg 

100 105 HO 

Thr Lys Asn Asn Pro Val Leu Val Gly Asp Ala Gly Val Gly Lys Thr 

115 120 125 

Ala Val Val Olu Gly Leu Ala Gin Ala He Val Asn Gly Asp Val Pro 

130 135 140 

Ala Ala He Lys Asn Lys Glu He He Ser He Asp He Ser Gly Leu 
145 150 155 160 

Glu Ala Gly Thr Gin Tyr Arg Gly Ser Phe Glu Glu Asn Val Gin Asn 

165 170 175 

Leu Val Asn Glu Val Lys Glu Ala Gly Asn He He Leu Phe Phe Asp 

180 185 190 

Glu He His Gin He Leu Gly Ala Gly Ser Thr Cys Gly Asp Ser Gly 

195 200 205 

Ser Lys Gly Leu Ala Asp He Leu Ser Gin Ser He Ser Leu Val Glu 
210 215 220 

Asn 

225 

(2) INFORMATION FOR SEQ ID NO: 246: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 



Met Thr He Phe Pro Asp Asp Phe Leu Trp Gly Gly Ala Val Ala Ala 

15 10 is 

Asn Gin Val Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val 

20 25 30 

Gin Asp Val Leu Pro Lys Gly Gly Leu Gly Glu Ala Thr Glu Asn Pro 

35 40 45 

Thr Glu Asp Asn Leu Lys Leu He Gly He Asp Phe Tyr His Lys Tyr 

50 55 60 

Lys Glu Asp He Ser Leu Phe Ser Glu Met Gly Phe Asn Val Phe Arg 
65 70 75 80 

Thr Ser He Ala Trp Ser Arg He Phe Pro Lys Gly Asp Glu Glu Glu 

85 90 95 

Pro Asn Glu Ala Gly Leu Lys Tyr Tyr Asp Glu Leu Phe Asp Glu Leu 

100 105 no 

His Ala His Gly lie Glu Pro Leu Val Thr Leu Ser His Tyr Glu Thr 

115 120 125 

Pro Leu Tyr Leu Ala Arg Lys Tyr His Gly Trp He Asp Arg Lys Asn 

130 135 140 

Asp Ser Phe Leu 
145 



(2) INFORMATION FOR SEQ ID NO:247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Met He Glu Tyr Lys Asn Val Ala Leu Arg Tyr Thr Glu Lys Asp Val 

15 io 15 

Leu Arg Asp Val Asn Leu Gin He Glu Asp Gly Glu Phe Met Val Leu 

20 25 30 

VaL Gly Pro Ser Gly Ser Gly Lys Thr Thr Met Leu Lys Met He Asn 

35 40 45 

Arg Leu Leu Glu Pro Thr Asp Gly Asn He Tyr Met Asp Gly Lys Arg 
50 55 60 
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lie Lys Asp Tyr Asp Glu Arg Glu Leu Arg Leu Ser Thr Gly Tyr Val 
65 70 75 80 

Leu Gin Ala lie Ala Leu Phe Pro Asn Leu Thr Val Ala Glu Asn lie 

85 90 95 

Ala Leu He Pro Glu Met Lys Gly Trp Ser Lys Glu Glu He Thr Lys 

100 105 110 

Lys Thr Glu Glu Leu Leu Ala Lys Val Gly Leu Pro Val Ala Glu Tyr 

115 120 125 

Gly His Arg Leu Pro Ser Glu Leu Ser Gly Gly Glu Gin Gin Arg Val 

130 135 140 

Gly He Val Arg Ala Met He Gly Gin Pro Lys He Phe Leu Met Asp 
145 150 155 160 

Glu Pro Phe Ser Ala Leu Asp Ala He Ser Arg Lys Gin Leu Gin Val 

165 170 175 

Leu Thr Lys Glu Leu His Lys Glu Phe Gly Met Thr Thr He Phe Val 

180 185 190 

Thr His Asp Thr Asp Glu Ala Leu Lys Leu Ala Asp Arg He Ala Val 

195 200 205 

Leu Gin Asp Gly Glu He Arg Gin Val Ala Asn Pro Glu Thr He Leu 

210 215 220 

Lys Val Pro Ala Thr Asp Phe Val Ala Asp Leu Phe Gly Gly Ser Val 
225 230 235 240 

His Asp 



(2) INFORMATION FOR SEQ ID NO; 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

Met Pro Phe Val Pro Gly He Ala Leu Thr Asn Ala Val Arg Asp He 

15 10 15 

Met Thr Asn His He Asn Ser Gly Met Ser Lys Met Phe Glu Ser Leu 

20 25 30 

Leu He Thr Leu Ala Leu Gly Ala Gly Thr Ser Val Ala Leu Val Leu 
35 40 45 
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Met Asn 
50 
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(2) INFORMATION FOR SEQ ID NO: 249 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 



Met Tyr Thr lie Arg Met Leu Lys 

1 5 
Ser Ala Gin Glu His Gly Leu Lys 
20 

Pro Gly Ser Gly Arg Glu Ser Ala 
35 40 

Ser 



Met Gly Ser Glu Ala Ala Ala Lys 

10 15 
Ser Val Glu Val Thr Val Lys Gly 
25 30 
lie Ser Cys Ala Cys Cys Arg Trp 
45 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Met He Glu Phe Glu Lys Pro Asn He Thr Lys He Asp Glu Asn Lys 

Asp Tyr Gly Lys Leu Val He Glu Pro Leu Glu Arg Gly Tyr Gly Thr 

20 25 30 

Ala Leu Gly Asn Ser Leu Arg Arg Val Leu Leu Ala Ser Leu Pro Gly 

35 40 45 

Ala Ala val Thr Ser He Asn He Asp Gly Val Leu His Glu Phe Asp 
50 55 60 

203 



wrt csnnmm PCT/US97/05306 
WO 97/37026 

Thr Val Pro Gly Val Arg Glu Asp Val Met Gin He He Leu Asn He 
65 70 75 80 

Lys Gly He Ala Val Lys Ser Tyr Val Glu Asp Glu Lys He He Glu 

85 90 95 

Leu Asp Val Glu Gly Pro Ala Glu Val Thr Ala Gly Asp He Leu Thr 

100 105 110 

Asp Ser Asp He Glu He Val Asn Pro Asp His Tyr Leu Phe Thr He 

115 120 125 

Gly Glu Gly Ser Ser Leu Lys Ala Thr Met Thr Val Asn Ser Gly Arg 

130 135 140 

Gly Tyr Val Pro Ala Asp Glu Asn Lys Lys Asp Asn Ala Pro Val Gly 
145 . 150 155 160 

Thr Leu Ala Val Asp Ser He Tyr Thr Pro Val Thr Lys Val Asn Tyr 

165 170 175 

Gin Val Glu Pro Ala Arg Val Gly Ser Asn Asp Gly Phe Asp Lys Leu 

180 185 190 

Thr Leu Glu He Leu Thr Asn Gly Thr He He Pro Glu Asp Ala Leu 

195 200 205 

Gly Leu Ser Ala Arg He Leu Thr Glu His Leu Asp Leu Phe Thr Asn 

210 215 220 

Leu Thr Glu He Ala Lys Ser Thr Glu Val Met Lys Glu Ala Asp Thr 
225 230 235 240 

Glu Ser Asp Asp Arg He Leu Asp Arg Thr He Glu Glu Leu Asp Leu 

245 250 255 

Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn Thr Val 

260 265 270 

His Asp Leu Thr Glu Lys Ser Glu Ala Glu Met Met Lys Val Arg Asn 

275 280 285 

Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Leu Lys Leu He Asp Leu 

290 295 300 

Gly Leu Gly Leu Lys Asp Lys 
305 310 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 
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Met Asp lie Glu Lys Gin Arg Gly lie Ser Val Thr Ser Ser Val Met 

15 10 15 

Gin Phe Asp Tyr Asp Gly Lys Arg Val Asn He Leu Asp Thr Pro Gly 

20 25 30 

His Glu Asp Phe Ser Glu Asp Thr Tyr Arg Thr Leu Met Ala Val Asp 

35 40 45 

Ala Ala Val Met Val Val Asp Ser Ala Lys Gly He Glu Ala Gin Thr 

50 55 60 

Lys Lys Leu Phe Glu Val Val Lys His Arg Gly He Pro Val Phe Thr 
65 70 75 80 

Phe Met Asn Lys Leu Asp Arg Asp Gly Arg Glu Pro Leu Asp Leu Leu 

65 90 95 

Gin Glu Leu Glu Glu He Leu Gly He Ala Ser Tyr Pro Met Asn Trp 

100 105 . no 

Pro He Gly Met Gly Lys Ala Phe Glu Gly Leu Tyr Asp Leu Tyr Asn 

115 120 125 

Gin Arg Leu Glu Leu Tyr Lys Gly Asp Glu Arg Phe Ala Ser Pro Arg 

130 135 140 

Arg Trp Arg Gin Thr Phe Trp 
145 150 

(2) INFORMATION FOR SEy ID NO:252: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 



Met Ala Val He Ala Glu Arg Lys Ala Tyr Tyr His Asp Pro Val Glu 

1 5 io 

Asp Ala He He Met Lys Arg Glu He Asp Glu Gly 
20 25 



15 



(2) INFORMATION FOR SEQ ID NO: 253 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

Met Asn Glu Thr Trp Ser Arg Thr Ala Leu Thr Leu Leu Lys His Gly 

15 10 15 

Ser Glu Val Asn Leu Glu Arg Ala Leu Ser Val Asn Gly Arg Leu Gly 

20 25 30 

Gly His Val Val Thr Gly His lie Asp Gly Thr Gly Lys lie Ser Ser 

35 40 45 

He Lys Lys Asp Asp Asn Ala Val Trp Tyr Gin He Asn Thr Gin Lys 

50 55 60 

Glu He Leu Asp Leu He Val Glu Lys Gly Ser He Thr He Asp Gly 
65 70 75 80 

He Ser Leu Thr Val Ala Lys Val Ser Lys Val Asn Phe Ser Val Ser 

85 90 95 

Val He Pro His Thr Leu Lys Gin Thr He Leu Lys Ser Lys Gin Val 

100 105 110 

Gly Ser Thr Val Asn Leu Glu Asn Asp He Leu Gly Lys Tyr Val Gin 

115 120 125 

Lys Leu Met Asp Asn Ser Pro Lys Ser Glu He Ser Lys Glu Leu Leu 

130 135 140 

Tyr Gin Asn Gly Phe 
145 

(2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Met Asn Trp Ser Ala Leu Glu Gin Gin He Val Asp Glu Leu Lys He 

15 10 15 

Tyr He Ala Pro Lys He Phe Gly Gly Ser Ala Lys Phe Pro Val Gly 

20 25 30 
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Gly Glu Gly lie Ser Leu Pro Asn Asp Ala lie Arg Leu Lys Pro Tyr 

35 40 45 

Ala Phe Ser Gin Xaa Gly Xaa Asp Tyr Leu He Glu Ser Glu Val He 

50 55 60 

Tyr Pro Cys Ser Gin Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

Met Val Leu Phe Leu Val Pro Gin Leu Arg Asn Ala Tyr Gly Thr Ala 

15 10 15 

Ala He Gly He He -tys Gly Leu Tyr Trp Ala Val Ser Ser Asn Met 

20 25 30 

Thr Val Glu Ala Thr Gin Arg Leu Thr Gly Gly Gly Gly Phe Ala He 

35 40 45 

Gly His Gin Gin Gin Phe Ala He Trp Phe Val Asp Lys Val Ala Gly 

50 55 60 

Arg Phe Gly Lys Lys Glu Glu Ser Leu Asp Asn Leu Lys Leu Pro Lys 
65 7 ° 75 80 

Phe Leu Ser He Phe His Asp Thr Val Val Ala Ser Ala Thr Phe Asp 

85 90 95 

Ala Arg He Leu Arg Gly His Ser Phe Asn Leu Gly Ser Arg His Tyr 
100 105 no 

Val 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Met Leu He Asn Ser Leu Asn Gin Ser Tyr He Ser Ala Lys Arg He 

15 10 15 

Glu Glu Val Phe Ala Glu Ser Pro Glu Asn He His Ser Glu Leu Glu 

20 25 30 

Gin Lys Gin Val Thr Ser Gly Arg Val Leu Gin Val Gin Glu Leu Thr 

35 40 45 

Phe Thr Tyr Pro Asp Ala Ala Gin Pro Ser Leu Arg Asp lie Ser Phe 

50 55 60 

Asp Met Thr Gin Gly Gin He Leu Gly He He Gly Gly Thr Gly Ser 
65 70 75 80 

Gly Lys Ser Ser Leu Val Gin Leu Leu Leu Gly Leu Tyr Pro Val Asp 

85 90 95 

Lys Gly Asn He Asp Leu Tyr Gin Asn Gly Arg Ser Pro Leu Asn Leu 

100 105 110 

Glu Gin Trp Arg Ser Trp He Ala Tyr Val Pro Gin Lys Val Lys Leu 

115 120 125 

Phe Lys Gly Thr He Arg Ser Asn Leu Thr Leu Gly Leu Asn Gin Glu 

130 135 140 

Val Ser Asp Gin Lys Leu Trp Gin Ala Leu Glu He Ala Gin Ala Lys 
145 150 155 160 

Asp Phe Val Ser Glu Lys Glu Gly Leu Leu Asp Ala Leu He Glu Ala 

165 170 175 

Gly Gly Arg Asn Phe Ser Gly Gly Gin Lys Gin Arg Leu Ser He Ala 

180 185 190 

Arg Ala Val Leu Arg Gin Ala Pro Phe lie lie Leu Asp Asp Ala Thr 

195 200 205 

Ser Ala Leu Asp Thr He Thr Glu Ser Lys Leu Leu Lys Ala lie Arg 

210 215 220 

Glu Asn Phe Pro Asn Thr Ser Leu He Leu lie Ser Gin Arg Thr Ser 
225 230 235 240 

Thr Leu Gin Met Ala Asp Gin He Leu Leu Leu Glu Lys Gly Glu Leu 

245 250 255 

Leu Ala Val Gly Lys His Asp Asp Leu Met Lys Ser Ser Gin Val Tyr 

260 265 270 

Arg Glu He Asn Ala Ser Gin His Gly Lys Glu Asp 
275 280 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 257: 



Met Lys Asp Pro Glu Ser Arg Leu Tyr He Leu Leu Lys Asp Gly Gin 

15 10 15 

Val lie Gly Thr Cys Thr Val Asp Leu Ser Thr Asn Thr Asn Tyr Phe 

20 25 3 0 

Tyr Gly Leu Ala He Leu Glu Pro Glu Arg Gly Lys Gly Tyr Gly Ser 

35 40 45 

Tyr Leu Ala Lys Ser Leu Val Asn Gin Leu He Glu Gin Asn Asp Lys 

50 55 60 

Glu Phe Gin He Ala Val Glu Asp Ser Asn Val Gly Ala Lys Arg Leu 
65 70 75 so 

Tyr Glu Lys He Gly Phe Val Lys Gin Thr Gin Val Val Tyr Leu Asn 

85 90 
Glu Lys Gly Ala Arg Asp Ser Glu Val 
100 105 



95 



(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
Met Phe Ala Leu Ser Ser Asn Lys Glu Leu Ala Glu Arg Val Ala Gin 

Glu He Gly He Glu Leu Gly Lys Ser Ser Val Arg Gin Phe Ser Asp 

20 25 30 

Gly Glu He Gin Val Asn He Glu Glu Ser He Arg Gly Lys His Val 

35 40 45 

Phe He Leu Gin Ser Thr Ser Ser Pro Val Asn Asp Asn Leu Leu Glu 
50 55 60 
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lie Leu lie Met Val Asp Ala Leu Lys Arg Ala Ser Ala Glu Ser Val 
65 70 75 80 

Asn Val Val Met Pro Tyr Tyr Gly Tyr Ala Arg Gin Asp Arg Lys Ala 

85 90 95 

Arg Ala Arg Glu Pro lie Thr Ser Lys Leu Val Ala Asn Met Leu Glu 

100 105 110 

Val Ala Gly Val Asp Arg Leu Leu Thr He Asp Leu His Ala Ala Gin 

115 120 125 

He Gin Gly Phe Phe Asp He Pro Val Asp His Leu Met Gly Ala Pro 

130 135 140 

Leu He Ala Asp Tyr Phe Glu Arg Pro Gly Met Val Gly Ser Asp Tyr 
145 150 155 160 

Val Val Val Ser Pro Asp His Gly Gly Val Thr Arg Ala Arg Lys Leu 

165 170 175 

Ala Glu Phe Leu Lys Thr Ser He Ala He He Glu Lys Arg Arg Ser 

180 185 190 

Val Asp Lys Met Asn Thr Ser Glu Val Met Asn Thr He Gly Lys Val 

195 200 205 

Glu Gly Asn His Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Met He Ser Phe Leu Leu Leu Leu Val Leu Val Trp Gly Phe Tyr He 

1 5 10 15 

Gly Tyr Arg Arg Gly Leu Leu Leu Gin Val Tyr Tyr Leu He Ser Ala 

20 25 30 

Met Ala Ser Ala Phe Met Ala Gly Gin Phe Tyr Lys Gly Leu Gly Glu 

35 40 45 

Gin Phe His Leu Leu Leu Pro Tyr Ala Asn Ser Gin Glu Gly Gin Gly 

50 55 60 

Thr Phe Phe Phe Pro Ser Asp Gin Leu Phe Gin Leu Asp Lys Val Phe 
65 70 75 80 
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Tyr Ala Gly He Gly Tyr Leu Leu Val Phe Gly He Val Tyr Ser He 

85 90 95 

Gly Arg Leu Leu Gly Leu Leu Leu His Leu He Pro Ser Lys Lys Leu 

100 105 110 

Gly Gly Lys Leu Phe Gin Val Ser Ala Gly He Leu Ser Met Leu Val 

115 120 125 

Thr Leu Phe Val Leu Gin Met Ala Leu Thr He Leu Ala Thr He Pro 

130 135 140 

Met Ala Val He Gin Asn Pro Leu Glu Lys Ser He Val Ala Lys His 
145 150 155 160 

He He Gin Ser He Pro lie Thr Thr Ser Trp Leu Lys Gin He Trp 

165 170 175 

Val Thr Asn Leu He Gly 
180 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 

Met Arg Ala Phe Ser Asp Phe Gly Thr Pro Met Leu He Gly Glu Gly 

15 10 15 

Tyr Arg Thr Phe Pro Val Leu He Tyr Thr Gin Phe He Ser Glu Val 

20 25 30 

Gly Gly Asn Ser Ala Phe Ala He Met Ala He He He Ala Leu Ala 

35 40 45 

He Phe Leu lie Gin Lys His He Ala Asn Arg Tyr Ser Phe Ser Met 

50 55 60 

Asn Leu Leu His Pro lie Glu Pro Lys Lys Thr Thr Lys Gly Lys Met 
65 70 75 80 

Ala Ala lie Tyr Ala Thr Val Tyr Gly lie He Phe He Ser Val Leu 

85 90 95 

Pro Gin lie Tyr Leu lie Tyr Thr Ser Phe Leu Lys Thr Ser Gly Met 

100 105 no 

Val Phe Val Lys Gly Tyr Ser Pro Asn Ser Tyr Lys Val Ala Phe Asn 
115 120 125 
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Arg Met Gly Ser Ala lie Phe Asn Thr lie Arg lie Pro Leu lie Ala 

130 135 140 

Leu Val Leu Val Val Pro lie Tyr Asp lie Tyr Leu Leu Pro Ser Arg 

145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 261: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 



Met Val Pro Tyr He Val Pro Gly 

1 5 
Ser Ser Phe Asn Thr Gly Leu Phe 
20 

Gly Thr Ala Phe He Leu He Met 

35 40 
Tyr Thr He Arg Ser Ser Val Ala 

50 55 
He Glu Glu Ala Ala Gly Lys Leu 
65 70 



Thr Val Leu Gly He Ala Phe lie 

10 15 
Gly Ser Gly Phe Leu Met He Thr 
25 30 
Ser Leu Se . Val Arg Arg Leu Pro 
45 

Ser Leu Gin Gin He Ala Pro Ser 
60 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 



Met He Phe lie Pro Met Ala Ala Tyr Ser lie Ala Arg Asn Met Ser 

15 10 15 

Lys Arg Lys Ala Phe Thr He His Val Tyr Pro Leu Asn Pro Arg Asn 
20 25 30 

212 



WO 97/37026 PCT/US97/05306 

Leu Arg Thr Phe Pro Ser His His Asp Ser Asp Tyr Gly Tyr Asp Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 2 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 



Met Lys Phe Arg Lys Leu Ala Cys 

1 5 
Leu Gly Leu Ala Ala Cys Gly Asn 
20 

Lys Ser Gly Gly Asp Gly Ala Lys 

35 40 
Pro Val Phe Thr Gin Glu Lys Thr 

50 55 
Lys Ser He He Gin Ala Phe 
65 70 



Thr Val Leu Ala Gly Ala Ala Val 

10 is 
Ser Gly Gly Ser Lys Asp Ala Ala 
25 30 
Thr Glu He Thr Trp Trp Ala Phe 
45 

Gly Asp Gly Val Gly Thr Tyr Glu 
60 



(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 264: 

Met Asn Ser Gin Gin Lys Pro Gin Glu He Lys Val Glu Glu Pro Val 

1 5 io 15 

Glu Ser Lys Glu Glu Thr Val Asn Gin Pro Val Glu Gin Pro Lys Val 

20 25 30 

Glu Thr Pro Ala Val Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys 
35 40 45 
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Val Glu Val Thr Ser lie Pro Gin Thr Thr Arg Tyr Glu Glu Asp Leu 

50 55 60 

Thr Lys Glu His Gly Thr Arg Glu Val Val Lys Glu Gly Lys Asn Gly 
65 70 75 80 

Ser Arg Thr Val Thr Thr Pro Tyr lie Leu Asn Ala Thr Asp Gly Thr 

85 90 95 

Thr Thr Glu Gly Thr Ser Thr Thr Asp Glu Ala Glu Met Glu Lys Glu 

100 105 110 

Val Val Arg Val Gly Thr Lys Pro Lys Glu Lys Leu Ala Pro Val Leu 

115 120 125 

Ser Leu Thr Ser Val Thr Asp Asn Ala Met Leu Arg Ser Ala Arg Leu 

130 135 140 

Thr Tyr His Leu Glu Asn Thr Asp Ser Val Asp Val Lys Lys lie His 
145 150 155 160 

Ala Glu He Lys Asn Gly Asp Lys Val Val Lys Thr He Asp Leu Ser 

165 170 175 

Lys Glu Arg Leu Ser Asp Ala Val Asp Gly Leu Glu Leu Tyr Lys Asp 

180 185 190 

Tyr Lys He Val Thr Ser Met Thr Tyr Asp Arg Gly Asn Gly Glu Glu 

195 200 205 

Thr Ser Thr Leu Glu Glu Thr Pro Leu Arg Leu Asp Leu Lys Lys Val 

210 215 220 

Glu Leu Lys Asn He Gly Ser Thr Asn Leu Val Lys Val Asn Glu Asp 
225 230 235 240 

Gly Thr Glu Val Ala Ser Asp Phe Leu Thr Ser Lys Pro Val Asp Val 

245 250 255 

Gin Asn Tyr Tyr Leu Lys Val Thr Ser Arg Asp Asn Lys Val Val Ser 
260 265 270 

Pro Pro Ser 
275 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 
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Met Ala Val Met Lys He Glu Tyr Tyr Ser Gin Val Leu Asp Met Glu 

15 10 15 

Trp Gly Val Asn Val Leu Tyr Pro Asp Ala Asn Arg Val Glu Glu Pro 

20 25 30 

Glu Cys Glu Asp He Pro Val Leu Tyr Leu Leu His Gly Met Ser Gly 

35 40 45 

Asn His Asn Ser Trp Leu Lys Arg Thr Asn Val Glu Arg Leu Leu Arg 

50 55 60 

Gly Thr Asn Leu He Val Val Met Pro Asn Thr Ser Asn Gly Trp Tyr 
65 70 75 80 

Thr Asp Thr Gin Tyr Gly Phe Asp Tyr Tyr Thr Ala Leu Ala Glu Glu 

85 90 95 

Leu Pro Gin Val Pro Glu Thr Leu Leu Pro 
100 105 



(2) INFORMATION FOR SEQ ID NO: 2 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



Met Arg Leu Leu Ala Met Lys Met 

1 5 
Lys He Thr Met Ser Leu Glu Asp 
20 

Ala Asp Phe Leu Val Pro Gin Glu 

35 40 
He Leu Gly 
50 



Lys Gin He Ser Asp Thr Thr Leu 

10 15 
Leu Met Asp Arg Gly Met Glu He 
25 30 
Lys Thr Glu Glu Phe Phe Tyr Ala 
45 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267: 

Met Leu Pro He He Cys Cys Gly Glu Ser Leu Glu Thr Tyr Glu Ala 

15 10 15 

Gly Lys Ala Ala Glu Phe Val Gly Ala Gin Val Ser Ala Ala Leu Ala 

20 25 30 

Gly Leu Thr Ala Glu Gin Val Ala Ala Ser Val He Ala Tyr Glu Pro 

35 40 45 

He Trp Ala He Gly Thr Gly Lys Ser Ala Ser Gin Asp Asp Ala Gin 

50 55 60 

Lys Met Cys Lys Val Val Arg Asp Val Val Ala Ala Asp Phe Gly Gin 
65 70 75 80 

Glu Val Ala Asp Lys Val Arg Val Gin Tyr Gly Gly Ser Val Lys Pro 

85 90 95 

Glu Asn Val Ala Ser Tyr Met Ala Cys Pro Asp Val Asp Gly Ala Leu 

100 105 HO 

Val Gly Gly Ala Ser Leu Glu Ala Glu Ser Phe Leu Ala Leu Leu Asp 

115 120 125 

Phe Val Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 268: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 : 

Met Phe Val Glu Ala Cys Lys Ala Val Val Arg Ala Asn Glu Glu Tyr 

15 10 15 

Val Pro Pro Tyr Gly He Gly Gly Thr Leu Tyr Leu Arg Pro Leu Leu 

20 25 30 

He Gly Val Gly Asp He He Gly Val Lys Pro Ala Glu Glu Tyr He 

35 40 45 

Phe Thr He Phe Ala Met Pro Val Gly Asn Tyr Phe Lys Gly Gly Leu 
50 55 60 
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Val Pro Thr Asn Phe Leu He Gin Asp Glu Tyr Asp Arg Ala Ala Pro 
65 70 75 80 

Asn Gly Thr Gly Ala Ala Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu 

85 90 95 

Leu Pro Gly Lys Met Ala Lys Ser Arg His Phe Ser Asp Val He Tyr 

100 105 no 

Leu Asp Pro Ser Thr His Thr Lys He Glu Glu Val Gly Ser Ala Asn 

115 120 125 

Phe Phe Gly He Thr Ala Asp Asn Glu Phe Val Thr Pro Leu Ser Pro 

130 135 i 4 o 

Ser He Leu Pro Ser He Thr Lys Tyr Ser Leu Leu Tyr Leu Ala Glu 
145 150 155 160 

His Arg Leu Gly Leu Thr Pro lie Glu Gly Asp Val Pro He Asp Asn 

165 170 175 

Leu Asp Arg Phe Val Lys Ala Gly Ala Cys Gly Thr Ala Ala Val He 

180 185 190 

Ser Pro He Gly Gly He Gin His Gly Asp Asp Phe His Val He Leu 
1^5 200 205 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 



Met Val Met He Ser Met Leu Phe Tyr Ser Glu Thr Glu Val Gly Pro 

15 10 15 

Val Thr Arg Lys Leu Tyr Asn Glu Leu Thr Gly He Gin Phe Gly Asp 

20 25 30 

He Glu Ala Pro Glu Gly Trp He Val Lys Val Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Met Thr Ala Ser Pro Leu Lys Lys Ser lie Lys Lys Lys Asn Arg Lys 

15 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys lie lie Gly lie Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys lie lie Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

Ser Phe Lys Asn Gly Glu lie lie Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 80 

Ala Val Thr Asn Pro Asp Thr Val lie Ser lie Lys Ser Lys Met Gly 

85 90 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu 

100 105 110 

lie S^r Ala Met lie Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr 

115 120 125 

Leu Gly Glu Lys Val Thr Lys Ala Val lie Thr Val Pro Ala Tyr Phe 

130 135 140 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys lie Ala Gly 
145 150 155 160 

Leu Glu Val Xaa Arg lie Val Asn Glu Pro Thr Ala Ala Ala Leu Ala 

165 170 175 

Tyr Gly Leu Asp Lys Thr Asp Lys Glu Glu Lys lie Leu Val Phe Asp 

180 185 190 

Leu Gly Gly Gly Thr Phe Asp Val Ser lie Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Asp Val Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp 

210 215 220 

Asp Phe Asp Gin Lys lie lie Asp His Leu Val Ala Glu Phe Lys Lys 
225 230 235 240 

Glu Asn Gly lie Arg Leu Val Tyr 
245 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Met Lys Leu He Val Asp Leu He Tyr Glu Gly Gly Phe Lys Lys Met 

15 10 15 

Arg Gin Ser He Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Val Ser Gly 

20 25 30 

Pro Arg Val He Thr Glu Gin Val Lys Glu Asn Met Lys Ala Val Leu 

35 40 45 

Ala Asp He Gin Asn Gly Lys Phe Ala Asn Asp Phe Val Asn Asp Tyr 

50 55 eo 

Lys Ala Gly Arg Pro Lys Leu Thr Ala Tyr Arg Glu Gin Ala Ala Asn 
65 ™ 75 80 

Leu Glu He Glu Lys Val Gly Ala Glu Leu Arg Lys Ala Met Pro Phe 

85 90 95 

Val Gly Lys Asn Asp Asp Asp Ala Phe Lys He Tyr Asn 
100 io5 

(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

15 10 15 

Arg Phe He Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

20 25 30 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

35 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 

50 55 60 

Asn Gly Glu Val Lys He Phe Arg Ala Leu Asn Glu Ala Ser He Arg 
65 70 75 80 
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Arg S r Asp Arg Thr Met Val Ala 
85 

Phe Glu Arg Phe Arg Gly Asp Gly 
100 

Ser Thr Ala Tyr Asn 
115 
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Asp lie Val lie Asn Gly Val Pro 

90 95 
Leu Thr Val Ser Thr Pro Thr Gly 
105 110 



(2) INFORMATION FOR SEQ ID NO: 273: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 



Met Gly Ala Gin Leu Ala Arg Glu Phe Lys His Glu Ala Asp lie Val 

1 5 10 15 

Val Gly Val Pro Asn Ser Ser Leu Ser Ala Ala Met Gly Phe Ala Glu 

20 25 30 

Glu Ser Gly Leu Pro Asn Glu Met Gly Leu He Lys Asn Gin Tyr Thr 

35 40 45 

Gin Arg Thr Phe He Gin Pro Thr Gin Glu Leu Arg Glu Gin Gly Val 

50 55 60 

Arg Met Lys Leu Ser Ala Val Ser Gly Val Val Lys Gly Lys Arg Val 
65 70 75 80 

Val Met Val Asp Asp Ser He Val Arg Gly Thr Thr Ser Arg Arg He 

85 90 95 

Val Gin Leu Leu Lys Glu Ala Gly Ala Thr Glu Val His Val Ala He 

100 105 110 

Gly Ser Pro Ala Leu Ala Tyr Pro Cys Phe Tyr Gly He Asp He Gin 

115 120 125 

Thr Arg Gin Glu Leu He Ala Ala Asn His Thr Val Glu Glu Thr Arg 

130 135 140 

Gin He He Gly Ala Asp Ser Leu Thr Tyr Leu Ser He Asp Ser Leu 
145 150 155 160 

He Glu Ser He Gly He Glu Thr Asp Ala Pro Asn Gly Gly Leu Cys 

165 170 175 

Val Ala Tyr Phe Asp Gly Asp Tyr Pro Thr Pro Leu Tyr Asp Tyr Glu 
180 185 190 
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Glu Asp Tyr Arg Arg Ser Leu Glu Glu Lys Thr Ser Phe Tyr Lys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 



Met Lys He Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys He 

1 5 io is 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu He Asn Gly 

20 25 30 

Ala Glu Val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg », r Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
65 70 75 go 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 go 95 

Pro He Asp Thr Pro He Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

100 105 no 

Ser Leu Pro He Lys Ala Met Val Gin Ala Asn Lys Lys Glu Gly Leu 

115 120 125 

Thr Gly Ser Leu Phe Pro He Arg Ala Gly Thr Phe Val Cys Ser His 

130 135 140 

Leu Met Tyr Gin Ala Leu Tyr Leu Val Glu Lys Lys Phe Pro Tyr Val 

I 45 150 ice 

x:,u 155 i6o 

Lys Ala Gly Phe Met His He Pro Tyr Met Met Glu Gin Val Val Asn 

165 170 175 

Arg Pro Thr Thr Pro Thr Met Ser Leu Val Asp He Arg Arg Gly He 

180 185 i 90 

Glu Ala Ala He Gly Ala Met He Glu His Gly Asp Gin Glu Leu Thr 

195 200 205 

Leu Val Gly Gly Glu He His 
210 215 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Arg Gly Lys 

15 10 15 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp lie Thr Asn Arg Gin lie Glu 

35 40 45 

Ala Ala Arg lie Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 60 

Trp lie Lys He Phe Pro His Lys Ser Tyr Thr Ala Lys Ala He Gly 
65 70 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 

85 90 95 

Val Lys Arg Gly Lys Val Met Phe Glu He Ala Gly Val Ser Glu Glu 

100 105 110 

He Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 276: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

{C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Met Gly Trp Trp Arg Glu Thr He Asp He Val Lys Glu Asn Asp Pro 
15 10 15 
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Ala Ala Arg Thr Thr Leu Glu Val Leu Leu Thr Tyr Pro Gly Val Lys 

20 25 30 

Ala Leu Ala Ala His Arg Leu Ser His Phe Leu Trp Lys Tyr Asp Phe 

35 40 45 

Lys Leu Leu Ala Arg Met His Ser Gin Phe Trp Arg Phe Trp Thr Gin 

50 55 60 

He Glu He His Pro Gly Ala Gin He Asp Ser Gly Val Phe He Asp 
65 70 75 80 

His Gly Ser Gly Leu Val lie Gly Glu Thr Ala He Val Glu Lys Gly 

85 90 95 

Val Leu Leu Tyr His Gly Val Thr Leu Gly Gly Thr Gly Lys Asp Cys 

100 105 no 

Gly Lys Arg His Pro Thr Val Arg Lys Gly Ala Leu He Ser Ala His 

115 120 125 

Ala Gin Val He Gly Pro Val Glu He Gly Glu Asn Ala Lys Val Gly 

130 135 140 

Ala Ala Ala Val Val Val Ala Asp Val Pro Ser Asp Val Thr Val Val 
145 150 155 160 

Gly He Pro Ala Lys He Val Arg Leu His Gly Lys Lys Asp Glu Pro 

165 170 175 

Val He His Glu Val Glu Glu Lys Arg Glu Tyr Tyr Val Asn Lys Leu 

I 80 185 190 

Glu Gin Ala Lys Asp Ala Ser His Arg Ser Ser Gly Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

Met Leu Phe Tyr Leu Leu Arg Asp Gly Lys Gly Leu Arg Asn Tyz Leu 

15 10 15 

Thr Gin Phe He Pro Ser Lys Leu Lys Glu Pro Val Gly Gin Val Leu 

20 25 30 

Ser Asp Val Asn Gin Gin Leu Ser Asn Tyr Val Arg Gly Gin Val Thr 
35 40 45 
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Val Ala lie He Val Ala Val Met Phe lie He Phe Phe Lys He He 

50 55 60 

Gly Leu Arg Tyr Ala Val Thr Leu Gly Val Thr Ala Gly He Leu Asn 
65 70 75 80 

Leu Val Pro Tyr Leu Gly Ser Phe Leu Ala Met Leu Pro Ala Leu Val 

85 90 95 

Leu Gly Leu He Ala Gly Pro Val Met Leu Leu Lys Val Val He Val 

100 105 no 

Phe He Val Glu Gin Thr He Glu Gly Arg Phe Val Ser Pro Leu He 

115 120 125 

Leu Gly Ser Gin Leu Asn He His Pro He Asn Val Leu Phe Val Leu 

130 135 140 

Leu Thr Ser Gly Ser Met Phe Gly He Trp Gly Val Leu Leu Gly He 
145 150 155 160 

Pro Val Tyr Ala Ser Ala Lys Val Val He Ser Ala He Phe Glu Trp 

165 170 175 

Tyr Lys Val Val Ser Gly Leu Tyr Glu Leu Glu Gly Glu Glu Val Lys 
180 185 190 

Ser Glu Gin 
195 



(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



Met Thr Lys Gin Val Pro Lys Phe 

1 5 
Thr Cys Lys Trp Leu Leu Tyr Asn 
20 

Thr Val Val Asn His Ser Val Arg 

35 40 
Thr Asn Thr He Glu Glu Thr Gly 
50 55 



Thr Lys Asp Thr Ala Gin Leu Tyr 

10 15 
Lys Val Thr Lys Met Tyr Asp His 
25 30 
Glu Tyr He Thr Asp Ser He Ser 
45 



(2) INFORMATION FOR SEQ ID NO: 279: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

Met Lys Lys Arg Ala lie Val Ala Val lie Val Leu Leu Leu lie Gly 

15 10 15 

Leu Asp Gin Leu Val Lys Ser Tyr He Val Gin Gin He Pro Leu Gly 

20 25 30 

Glu Val Arg Ser Trp He Pro Asn Phe Val Ser Leu Thr Tyr Leu Gin 

35 40 45 

Asn Arg Gly Ala Ala Phe Ser He Leu Gin Asp Gin Gin Leu Leu Phe 

50 55 60 

Ala Val He Thr Leu Val Val Val He Gly Ala He Trp Tyr Leu His 
65 70 75 80 

Lys His Met Glu Asp Ser Phe Trp Met Val Leu Gly Leu Thr Leu He 

85 90 95 

He Ala Gly Gly Leu Gly Asn Phe He Asp Arg Val Ser Gin Gly Phe 

100 105 HO 

Val Val Asp Met Phe His Leu Asp Phe He Asn Phe Ala He Phe Asn 

115 120 125 

Val Ala Asp Asn Tyr Leu Thr Val Gly Val lie He Leu Leu He Ala 

130 135 140 

Met Leu Lys Glu Glu He Asn Gly Asn 
145 150 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: v 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
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Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

15 10 15 

Glu Gin Gly lie Gin Ser Thr Thr Val Arg His Leu He Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly He Lys Lys Leu Asp Thr Tyr Leu 

35 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

50 55 60 

Arg Asp Met Glu Arg Tyr Ser Arg His He Glu Glu Leu Tyr Lys Glu 
65 70 75 so 

He Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Thr Tyr Val 

85 90 95 

Trp Ala Leu Lys Arg Arg He Trp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

Met His Val Arg Met He Pro Lys Ser Thr Pro Asp Thr Lys Phe Ala 

15 10 15 

Asp Val Ala Thr His Gin Pro Glu Tyr $er Arg Asp Asn Val Ala Gly 

20 25 30 

Thr He Val Gly Phe Trp Thr Pro Glu He Phe His Gly Val Ser Val 

35 40 45 

Ala Gly Tyr His Leu His Phe He Ser Asp Asp Leu Thr Phe Gly Gly 

50 55 60 

His Val Met Asp Phe Val He Lys Glu Gly He He Glu Val Gly Ala 
65 70 75 80 

Val Asp Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin Tyr Leu 

85 90 95 

Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu Lys Ala 
100 105 HO 

Glu 
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(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: ' 

Met Ala Val Ala Lys Gly Lys Leu Thr lie He Ala His Val Ala Cys 

Asn Asn Thr Lys Asp Ser Met Glu Leu Ala Arg His Ala Glu Ser Leu 
20 25 



30 

Gly Val Asp Ala He Ala Thr Asp Ser Thr Asn Leu Phe Pro Leu Ala 

35 40 45 

Arg lie Thr Gin Leu Pro Asn Thr Gly Thr lie Ser Val Leu Gin Leu 

50 55 60 

Gin Thr Gin Thr Thr 
55 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 83: 

Met Gly Glu Thr Gin lie He Asp Gly Leu Asp Pro Glu Tyr Lys Lys 

Arg Phe Met His His Tyr Asn Phe Pro Gin Tyr Ser Val Gly gL Thr 

20 25 ^ 

Gly Arg Tyr Gly Ala Pro Gly Arg Arg Glu lie Gly His Gly Ala Leu 

35 4 ° 45 

Gly Glu Arg Ala Leu Ala Gin Val Leu Pro Ser Leu Glu Glu Phe Pro 



55 



60 
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Tyr Ala lie Arg Leu Val Ala Glu Val Leu Glu Ser Asn Gly Ser Ser 
65 70 75 80 

Ser Gin Ala Ser He Cys Ala Gly Thr Leu Ala Leu Met Ala Gly Gly 

85 90 95 

Val Pro He Lys Ala Pro Val Ala Gly He Ala Met Gly Leu He Ser 

100 105 110 

Asp Gly Asn Asn Tyr Thr Val Leu Thr Asp He Gin Gly Leu Glu Asp 

115 120 125 

His Phe Gly Asp Met Asp Phe Lys Val Ala Gly Thr Arg Asp Gly He 

130 135 140 

Thr Ala Leu Gin Met Asp lie Lys He Gin Gly He Thr Ala Glu He 
145 150 155 160 

Leu Thr Glu Ala Leu Ala Gin Ala Lys Lys Ala Arg Phe Glu He Leu 

165 170 175 

Asp Val He Glu Ala Thr He Pro Glu Val Arg Pro Glu Leu Ala Pro 

180 185 190 

Thr Ala Pro Lys He Asp Thr He Lys lie Asp Val Asp Lys He Lys 

195 200 205 

lie Val lie Gly Lys Gly Gly Glu Thr lie Asp Lys lie lie Ala Glu 

210 215 220 

Thr Gly Val Lys He Asp lie Asp Glu Glu Xaa Asn Val Phe Tyr Leu 
225 230 235 240 

Leu Leu Val Asp Gin Asn Ala He Asn Pro Cys Pro Lys Lys Leu Leu 

245 250 255 

Leu Val Trp Phe Arg Glu Pro Lys Trp Met Lys Phe Thr Val Leu Asn 

260 265 270 

Arg Ser Tyr Arg Glu Phe Gly Ala Phe Val Thr Leu 
275 280 

(2) INFORMATION FOR SEQ ID NO:284: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 84: 

Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
15 10 15 
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Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 

35 40 45 

He Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 

50 55 60 

Ala Lys Lys Gly Ser Leu He Gly Val Thr Gly Arg He Gin Thr Arg 
65 7 0 75 eo 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 

Ala Glu Asn Phe Gin Met Leu Glu Ser Arg Ser Val Arg Glu Gly His 

100 105 no 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala Pro Thr 

115 ^ 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 

"° 135 i 4 o 

Asn Pro Leu Asp He Ser Xaa Asp Asp Leu Pro Phe 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



Met Asp Leu val Arg He Leu Lys Glu Pro Arg Asn Ala Leu Val Lys 

1 5 io 15 

Gin Tyr Gin Thr Leu Leu Ser Tyr Asp Asp Val Glu Leu Glu Phe Asp 
20 25 



30 



Asp Glu Ala Leu Gin Glu lie Ala Asn Lys Ala He Glu Arg Lys Thr 

35 40 45 

Gly Ala Arg Gly Leu Arg Ser He He Glu Glu Thr Met Leu Asp Val 



50 



55 60 
Met Phe Glu Val Pro Ser Gin Glu Asn Val Lys Leu Val Arg He Thr 
65 70 75 80 

Lys Glu Thr Val Asp Gly Thr Asp Lys Pro lie Leu Glu Thr Ala 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO: 286: 



PCIYUS97/05306 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 104 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 

Met Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn lie Leu Leu 

15 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn lie Glu Arg Ala Glu Arg Gly 

20 25 30 

lie He Tyr Val Asp Glu He Asp Lys He Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys He He Glu Gly Thr Val Ale Ser Val Pro Pro Gin Gly Gly 
65 70 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg Gly Trp Cys Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Met Trp Pro Arg Glu Ser Arg Ser Cys Arg Leu Thr Lys Val Lys He 
15 10 15 
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Cys Gly Leu Ser Thr Lys Glu Ala Val Glu Thr Ala Val Ser Ala Gly 

20 25 30 

Ala Asp Tyr lie Gly Phe Val Phe Ala Pro Ser Lys Arg Gin Val Thr 

35 40 45 

Leu Glu Glu Ala Ala Glu Leu Ala Lys Leu lie Pro Ala Asp Val Lys 

50 55 60 

Lys Val Gly Val Phe Val Ser Pro Ser Arg Val Glu Leu Leu Glu Ala 
65 70 75 80 

lie Asp Lys Val Gly Leu Asp Leu Val Gin Val His Gly Gin Val Ala 

85 90 95 

Asp Asp Leu Phe Glu Asn Leu Pro Cys Ala Ser lie Gin Ala Val Gin 

100 105 110 

val Asp Gly Asn Gly His Val Pro Asn Ser Gin Ala Asp Tyr Leu Leu 

115 120 125 

Phe Asp Ala Pro Val Ala Gly Ser Gly Gin Ser Phe Asp Trp Gly Gin 

I 30 135 140 

Leu Asp Thr Thr Gly Leu Ala Gin Pro Phe Phe lie Ala Gly Gly Leu 
145 150 155 160 

Asn Glu Asp Asn Val Val Lys Ala lie Gin His Phe Thr Pro Tyr Ala 

165 170 175 

Val Asp Val Ser Ser Gly Val Glu Thr Asp Gly Gin Lys Asp His Glu 

180 i 85 lgo 

Lys lie Arg Arg Phe lie Glu Arg Val Lys His Gly He Ser Gly Thr 
195 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 
Met Asn Cys Glu Ala Val Ala Leu Gly Ser Phe Cys Glu Leu Lys Ser 
Arg Arg Gly Lys Lys Gin Lys Gly Glu He Leu Met Ala Val lie Ser 



25 30 
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Met Lys Gin Leu Leu Glu Ala Gly 

35 40 
Arg Trp Asn Pro Lys Met Ala Lys 

50 55 
lie His Val lie Asp Leu Gin Gin 
65 70 
Tyr Xaa Phe Met Arg Asp Ala Ala 
85 

Val Gly Thr Lys Lys Thr Ser Ser 
100 
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Val His Phe Gly His Gin Thr Arg 
45 

Tyr He Phe Thr Glu Arg Asn Gly 
60 

Thr Val Lys Tyr Ala Asp Gin Ala 

75 80 
Ala Asn Asp Ala Val Val Leu Phe 
90 95 



(2) INFORMATION FOR SEQ ID NO: 289: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 



(Xi) SEQUENCE D T -,CRIPTION: SEQ ID NO:289: 



Met Val Asp Thr Asn Thr Asp Pro Asp Asp He Asp Val He He Pro 

15 10 15 

Ala Asn Asp Asp Ala He Arg Ala Val Lys Leu He Thr Ala Lys Leu 

20 25 30 

Ala Asp Ala He He Glu Gly Arg Gin Gly Glu Asp Ala Val Ala Val 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 290: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

( C } STRANDEDNESS : S ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



Met Glu Glu Leu Gly Gin Ser Tyr Gly Tyr Leu Leu Tyr Arg Thr Glu 
15 10 15 
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Thr Asn Trp Asp Ala 
20 

Asp Arg Ala Gin Leu 
35 

Gin Thr Glu lie Gly 
50 

Leu Ser Arg Leu Asp 
65 

Gly His Lys Phe Leu 
85 

Val Cys Lys Asp Leu 
100 

Pro Leu Asp Asn Pro 
115 

Gly Gin Pro Ala Phe 
130 



Glu Glu Glu Arg Leu Arg 
25 

Tyr Val Asp Gly Gin Trp 
40 

Glu Asp lie Phe Tyr Gin 
55 

He Leu He Glu Asn Met 
70 75 
Ala Asp Thr Gin Arg Lys 
90 

His Phe Leu Leu Asn Trp 
105 

Glu Lys He Asp Phe Ser 
120 

Tyr Ala Tyr Asp Phe Thr 
135 
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He He Asp Gly Arg 
30 

Val Lys Thr Gin Tyr 
45 

Gly Lys Lys Lys Gly 
60 

Gly Arg Val Asn Tyr 
80 

Gly He Arg Thr Gly 
95 

Lys His Tyr Pro Leu 
110 

Lys Gly Trp Thr Gin 

125 
Val Glu 
140 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Met Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro Gly 

1 5 io 15 

Leu Ala Tyr Tyr Val He Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

He Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Thr 

35 40 45 

Ser Ser Gin Leu Glu He Thr Asp Ser Asp Gly He Val Thr Cys He 

50 55 60 

Arg Leu Ala Arg 
65 



(2) INFORMATION FOR SEQ ID NO:292: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 



Met Asn Lys Lys Gin Trp Leu Gly 

1 5 
Val Gly Leu Ala Ala Cys Gly Asn 
20 

Ser Ser Asp Val Lys Thr Lys Ala 

35 40 
Val Asp Asp Lys Ser Phe Asn Gin 
50 55 

Gly 
65 



Leu Gly Leu Val Ala Val Ala Ala 

10 15 
Arg Ser Ser Arg Asn Ala Ala Ser 
25 30 
Ala He Val Thr Asp Thr Gly Gly 
45 

Ser Ala Trp Glu Val Ala Gly Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 293: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 



Met Ser Gin Trp Asp Arg Lys Leu Asp Ala Arg Leu Ala Gin Ala Val 

15 10 15 

Val Ser He Asn Ala Phe Lys Gly Val Glu Phe Gly Leu Gly Phe Glu 

20 25 30 

Ala Gly Tyr Arg Lys Gly Ser Gin Val Met Asp Glu He Leu Trp Ser 

35 40 45 

Lys Glu Asp Gly Tyr Thr Arg Arg Thr Asn Asn Leu Gly Gly Phe Glu 

50 55 60 

Gly Gly Met Thr Asn Gly Gin Pro He Val Val Arg Gly Val Met Lys 
65 70 75 80 

Pro He Pro Thr Leu Tyr Lys Pro Leu Met Ser Val Asp He Glu Thr 
85 90 95 
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His Glu Pro Tyr Lys Ala Thr Val Glu Arg Ser Asp Pro Thr Ala Leu 

100 105 no 

Pro Ala Ala Gly Met Val Met Glu Ala Val Val Ala Thr Val Leu Ala 

"5 120 125 

Gin Glu He Leu Glu Lys Ph Ser Ser Asp Asn Leu Glu Glu Leu Lys 

130 135 140 

Glu Ala Val Ala Lys His Arg Asp Tyr Thr Lys Asn Tyr 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 56 amino acids 
(B} TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94: 

Met Glu Asp Ser Asn Ser Phe Met Leu Ser His Gin Met Gl* Thr He 

15 10 15 

Leu Lys Glu Ala Gly Phe Thr Lys Ala Val Ser Tyr Phe He Leu Glu 

20 25 30 

Leu Lys Asp Pro Ser Gin Thr Lys Val Val Thr Glu Glu Leu Gin Lys 

35 40 45 

Asn Lys Lys Tyr Thr Val Leu Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 
Met Gly Lys Lys Arg Trp Ala Arg Asn Gly Phe Glu Ser Asn Asp Ala 
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Ser Tyr Ala Gin Val Val Ser Leu 
20 

Ser Asn Asn Glu Thr Asp Lys Val 

35 40 
Thr Asn Glu Gin Gly Leu Thr lie 
50 55 

Glu 
65 
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Tyr Asp Asp Thr Ser He Ser Val 

25 30 

Leu Ala Gly Ser Leu Tyr Thr Asp 
45 

Pro Ser Ser Phe Thr Lys Lys Leu 
60 



(2) INFORMATION FOR SEQ ID NO; 296: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Met Ala Met Glu Ser Gly Ala Thr Ala Val Val Ala Glu Arg Gly Gin 

15 10 15 

Glu Arg He Thr Lys Val Arg Glu He Leu Gly Gly Gly Ala Asp Ala 

20 25 30 

Ala Leu Glu Cys Val Gly Thr Glu Ala Ala He Glu Gin Ala Leu Gly 

35 40 45 

Val Leu His Asn Gly Gly Arg Met Gly Phe Val Gly Val Pro His Tyr 

50 55 60 

Asn Asn Arg Ala Leu Gly Ser Thr Phe Met Gin Asn He Ser Val Ala 
65 70 75 80 

Gly Gly Ala Ala Ser Ala Thr Thr Tyr Asp Lys Gin Phe Leu Leu Lys 

85 90 95 

Ala Val Leu Asp Gly Asp He Asn Pro Gly Arg Val Phe Thr Ser Ser 

100 105 110 

Tyr Lys Leu Glu Asp lie Asp Gin Ala Tyr Lys Asp Met Asp Glu Arg 

115 120 125 

Lys Thr He Lys Ser Met He Val lie Glu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:297: 

Met Lys Gin Val Val Asp Tyr Phe Leu Ser Gin Gly Met Asp Arg lie 

15 10 15 

Gly lie Leu Thr Gly Leu Glu Glu Thr Thr Asp Gin Glu Glu He He 

20 25 30 

Gin Asp Lys Arg Leu Glu Asn Phe Lys Asn Tyr Ser Gin Ala Arg Gly 

35 40 45 

He Tyr His Asp Glu Leu Val Phe Gin Gly Arg Phe Thr Ala Gin Ser 

50 55 60 

Gly Tyr Asp Leu Met Lys Glu Ala lie Gin Ser Leu Gly Asp Gin Leu 
65 7 0 75 60 

Pro Pro Ala Phe Phe Ala Ala Ser Asp Ser Leu Ala He Gly Ala Leu 

85 so 95 

Arg Ala Leu Gin Glu Ala Gly He Ser Leu Pro Asp Arg Val Ser Leu 

100 105 110 

He Phe Leu Leu Thr Thr Leu Ser Leu Thr Lys Gin Val Tyr Pro Pro 
115 120 125 

Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: aroino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

Met Ala Lys Val Thr lie Met Leu Ala Cys Ala Ala Gly Met Ser Thr 

Ser Leu Leu Val Thr Lys Met Gin Lys Ala Ala Glu Asp Lys Gly Leu 
20 25 3 0 
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Asp Ala Glu He Phe Ala Val Pro Ala Pro Glu Ala Glu Glu He Val 

35 40 45 

Ala Thr Lys Glu Val Asn Val Leu Leu Leu Gly Pro Gin Val Arg Tyr 

50 55 60 

Leu Leu Gly Asp Phe Gin Glu Lys Leu Lys Asp Arg Gin He Pro Val 
65 70 75 80 

Ala Val He Pro Met Thr Asp Tyr Gly Met Met Asn Gly Ser Lys Val 

85 90 95 

Leu Asp Leu Ala Glu Ser Leu Leu Asp 



(2) INFORMATION FOR SEQ ID NO: 299: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 

15 10 15 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val He Leu Val Pro Ala Val 

20 25 30 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val val Asp 

35 40 45 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala He Gly Thr Ser 

50 55 60 

Gin Glu Thr Ser Asp Thr Asp Thr lie Lys Glu He Ala Leu Arg Asn 
65 70 75 80 

Lys He Cys Gly Val Asp lie Gin His lie Gly Asp Ala Gly Tyr Gly 

85 90 95 

Gly Leu Ala Thr Val Asp Asn He Tyr Ala Leu Ser Lys Ala lie Arg 

100 105 110 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 



100 



105 



115 



120 



125 



(2) INFORMATION FOR SEQ ID 



NO:300: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 6 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 : 



Met Ser Ala He Asn Leu Ala Ser Val Pro Lys Glu Ser Leu Thr Gin 

15 10 is 

Val Leu Pro Arg Asp Leu His Ala Glu Tyr Phe Ala Val Leu Ala Ser 

20 25 30 

He Ala Thr Ser He Glu Arg Met Ala Thr Glu He Arg Gly Leu Gin 

35 40 45 

Lys Ser Glu Gin Arg Glu Val Glu Glu Phe Phe Ala Lys Gly Gin Lys 

50 55 60 

Gly Ser Ser Ala Met Pro His Lys Arg Asn Pro He Gly Ser Glu Asn 
65 70 75 80 

Met Thr Gly Leu Ala Arg Val He Arg Gly His Met He Thr Ala Tyr 

85 90 95 

Glu Asn Val Ala Leu Trp His Glu Arg Asp He Ser His Ser Ser Ala 

100 105 HO 

Glu Arg He He Thr Pro Asp Tnr Thr He Leu He Asp Tyr Met Leu 

115 120 125 

Asn Arg Phe Gly Asn He Val Lys Asn Leu Thr Val Phe Pro Glu Asn 

130 135 140 

Met He Arg Asn Met Asn Ser Thr Phe Gly Leu He Phe Ser Gin Arg 
145 150 155 160 

Ala Met Leu Thr Leu He Glu Lys Gly Met Thr Arg Glu Gin Ala Tyr 

165 170 175 

Asp Leu Val Gin Pro Lys Thr Ala Tyr Ser Trp Asp Asn Gin Val Asp 

180 185 190 

Phe Lys Pro Leu Leu Glu Ala Asp Ser Glu Val Thr Ser Arg Leu Thr 

195 200 205 

Gin Glu Glu He Asp Glu He Phe Asn Pro Val Tyr Tyr Thr Lys Arg 

210 215 220 

Val Asp Asp He Phe Glu Arg Leu Gly Leu Gly Asp 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:301: 

Met Leu Asn Leu Thr His Val Thr Leu Lys Thr Arg Gin Val He Leu 

15 10 15 

Gin Asp Ala Asp Phe Thr Phe Lys Lys Gly Arg He Tyr Gly Leu Leu 

20 25 30 

Ala He Asn Gly Ser Gly Lys Thr Thr Leu Phe Arg Ala Met Ser Lys 

35 40 45 

Leu Leu Pro Leu Ser Ser Gly His He Ala Val Pro Pro Ser Leu Phe 

50 55 60 

Tyr Tyr Glu Ser Val Glu Trp Leu Asp Gly Asn Leu Ser Gly Met Asp 
65 70 75 80 

Tyr Leu Arg Leu He Lys Lys His Leu Glu Val Arg Pro Lys Leu Glu 
85 90 95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala Glu Phe Leu Asn 

15 10 15 

Phe Val Leu Ser Asp Glu Thr Gin Glu Gly He Val Lys Gly Leu Lys 

20 25 30 

Tyr He Pro He Lys Glu Met Lys Val Glu Lys Asp Ala Ala Gly Thr 

35 40 45 

Val Thr Val Leu Glu Gly Arg Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 303: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 03: 

Met Asn Gin Glu Glu Leu Ala Lys Lys Met Leu Leu Pro Ser Lys Asn 

1 5 • 10 15 

Ser Arg Leu Glu Lys Leu Gly Lys Gly Leu Thr Phe Ala Cys Leu Ser 

20 25 30 

Leu He Val He Leu Val Ala Met He Leu Val Phe Val Ala Gin Lys 

35 40 45 

Gly Leu Ser Thr Phe Phe Val Asn Gly Val Asn He Phe Asp Phe Leu 

50 55 60 

Leu Gly Gly Thr Trp Asn Pro Ser Ser Lys Glu Phe Gly Ala Leu Pro 
65 ^ ™ 75 80 

Met He Leu Gly er Phe He Val Thr lie" Leu Ser Ala Leu He Ala 

85 90 95 

Thr Pro Phe Ala lie Gly Ala Ala Val Phe Met Thr Glu Val Ser Pro 

100 105 no 

Lys Gly Ala Lys lie Leu Gin Pro Ala He Glu Leu Leu Val Gly He 

115 120 125 

Pro Ser Val Val Tyr Gly Phe He Gly Leu Gin Val Val Val Pro Phe 

130 135. 140 

Val Arg Ser Val Phe Gly Gly Thr Gly Phe Gly He Leu Ser Gly lie 
145 150 155 I60 

Ser Val Leu Phe Val Met He Leu Pro Thr Val Thr Phe Met Thr Thr 

l fi5 170 175 

Asp Ser Leu Arg Ala Val Pro Xaa Leu Leu Ser 
180 i 8 5 

(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Thr Glu lie Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

15 10 15 

Arg He Leu Glu Asp He Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser He Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 40 45 

He Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 70 75 80 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala He Ser Arg Ala Asp Lys He 

100 105 110 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

115 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Leu Phe Leu Leu Asp Glu Ala Phe Ser Ala Leu Asp 
145 150 155 160 

Glu Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys 

165 170 175 

Gin Leu Gin Leu Thr Thr Leu He Xaa Thr His Ser He Glu Glu Ala 

180 185 190 

Leu Xaa Leu Ser Asp Arg He Tyr He Leu Xaa Asn Xaa Pro Gly Gin 

195 200 205 

He Val Ser Glu He Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu 

210 215 220 

Val Xaa Lys He Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu 
225 230 235 240 

Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 305: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: 

Met Glu His Pro Xaa Glu Ala Ala Asp lie Leu He Lys Asn Ala Pro 

15 10 15 

Glu Leu Lys Glu Lys Arg Asp Phe Val He Glu Ser Gin Lys Tyr Leu 

20 25 30 

Ser Lys Glu Tyr Ala Ser Asp Lys Glu Lys Trp Gly Gin Phe Asp Ala 

35 40 45 

Ala Arg Trp Asn Ala Phe Tyr Lys Trp Asp Lys Glu Asn Gly He Leu 

50 55 60 

Lys Glu Asp Leu Thr Asp Lys Gly Phe Thr Asn Glu Phe Val Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 

15 10 15 

Leu Asn He Ala His Leu Tyr Gly Asn Leu Met Asn Thr Tyr Gly Asp 

20 25 30 

Asn Gly Asn He Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Thr 

35 40 45 

His Val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 

50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 70 75 80 

He He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 90 95 

He Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 

100 105 no 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
"5 120 12 5 
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Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe lie 

130 135 140 

Gly Asp lie Lys lie His Asn Glu Arg Phe Arg 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 307: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 : 



Met His Gin His Ser Pro His Thr Ala Ser Tyr Thr Ser Ser Arg Asp 

15 10 15 

Ala Met Ser Lys Tyr Pro Glu Arg Cys Thr Thr Val Gly Leu Arg Leu 

20 25 30 

Asn Glu Glu Ser Asp Phe Glu Leu Tyr Ala Pro Tyr Gly Lf o Glu Asp 

35 40 45 

lie Leu Asn Phe Lys Phe Val Gin Leu Leu lie Ser 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 308 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 



Met Phe Glu lie Phe Asn Met Gly Val Gly Leu Met Leu Ala Val Ser 

15 10 15 

Pro Glu Asn Val Glu Arg Val Lys Glu Leu Leu Asp Glu Ala Val Tyr 

20 25 30 

Glu lie Gly Arg He Val Lys Lys Glu Asn Glu Ser Val He He Lys 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 150 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



Met Met Val Lys Lys His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys 

Met Arg Arg lie Leu Leu Leu Val Ala Val Leu Leu Met He Pro Ser 

20 25 30 

Phe He Ser Ala Thr Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser 

35 40 45 

.Leu Lys Lys Phe lie Ser Glu Gin Phe Gin Gly His Asn He Leu Lys 

50 55 60 

Lys Thr Tyr Ser Lys Lys Thr His Thr Leu Lys Leu Thr He Ser Gly 
65 70 75 so 

Asn Tyr Leu Thr Glu Glu Glu Leu Asp Met He Ser Ser Lys Arg Gly 

85 90 95 

Asp Tyr Gly Leu Ser Asp Val Ser Val Gin Val Ser Gin Leu Ser Asp 

100 105 110 

Ser Glu Gin Leu Ser Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr 

115 120 125 

He Lys Asp Lys Glu Ala Lys Glu Lys Glu Lys Ala Asn Lys Phe Tyr 

130 135 140 

Thr Glu Ser Glu Glu Gin 
145 150 



(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



Met Ser lie Glu Pro Arg Arg Arg Trp Arg Asn Ser Thr Val Cys Leu 

15 10 15 

Glu Ala Glu Leu Tyr Gin Glu Ser Leu Val Leu lie Gly Gly Asp Pro 

20 25 30 

Gly He Gly Lys Ser Thr Leu Leu Leu Gin Val Ser Thr Gin Leu Ser 

35 40 45 

Gin Val Gly Thr Val Leu Tyr Val Ser Gly Glu Glu Ser Ala Gin Gin 

50 55 60 

He Lys Leu Arg Ala Glu Arg Leu Gly Asp Phe Asp Ser Glu Phe Tyr 
65 70 75 80 

Leu Tyr Ala Glu Thr Asn Met Gin Ser Val Arg Ala Glu Val Val Ala 

85 90 95 

Tyr Pro Ala Arg Leu Ser His Tyr 
100 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Met Ala Thr Lys Gin Lys Glu Val Thr Thr Phe Asp Val Gin Val Ala 

15 10 15 

Glu Phe He Arg Asn His Lys Gin Lys Gly Thr Ala Thr Asp Asp Glu 

20 25 30 

He Asn Ala Ser Leu Val He Pro Phe Thr Leu Asp Ala Asp Gly He 

35 40 45 

Glu Asp Leu Leu Gin Arg He Gin Asp Ala Gly He Ser He Thr Asp 

50 55 60 

Asn Glu Gly Asn Pro Ser Ala Arg Val Leu Ser Asn Glu Glu Glu Pro 
65 70 75 80 

Glu Leu Ser Asp Glu Asp Leu He Gly Ser Thr Ser Ala Lys Val Asn 

85 90 95 

Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Val Val Pro Leu Leu 
100 105 110 
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Thr Asn Glu Glu Glu Lys Glu Leu Ala Leu Ala Val Glu Ala Gly Asp 

115 120 125 

lie Glu Ala Lys Gin Arg Leu Ala Glu Ala Asn Leu Arg Leu Val Val 

130 135 140 

Ser He Ala Lys Arg Tyr Val Gly Arg Gly Xaa Gin Ser Leu Thr 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 12: 

Met Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser 

15 10 15 

Lys Tyr Ser Leu Val lie Leu Glu Ala Lys Arg Ala His Glu Leu Glu 

20 25 30 

Ala Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu 

35 40 45 

Arg Ala Leu Glu Glu lie Glu Ser Gly Asn Val Thr lie His Pro Asp 

50 55 60 

Pro Glu Gly Lys Arg Glu Ala Val Arg Arg Arg lie Glu Glu Glu Lys 

V 70 75 80 

Arg Arg Lys Glu Glu Glu Glu Lys Lys lie Lys Glu Gin lie Ala Lys 

85 90 95 

Glu Lys Glu Asp Gly Glu Lys He 
100 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 



Met lie Arg Gin Gly Gin Met Leu Glu Tyr Ala Glu Tyr Val Gly Asn 

15 10 15 

Tyr Tyr Gly Thr Pro Leu Thr Tyr Val Asn Glu Thr Leu Asp Lys Gly 

20 25 30 

He Asp Val Phe Leu Glu He Glu Val Gin Gly Ala Leu Gin Val Lys 

35 40 45 

Lys Lys Val Pro Asp Ala Val Phe He Phe Leu Thr Pro Pro Asp Leu 

50 55 60 

Asp Glu Leu Gin Glu Arg Leu Val Gly Arg Gly Thr Asp Ser Ala Glu 
65 70 75 80 

Val He Ala Gin Arg He Glu Lys Ala Lys Glu Glu He Ala Leu Met 

85 90 95 

Arg Glu Tyr Asp Tyr Ala He Val Asn Asp Gin Val Pro Leu Ala Ala 

100 105 110 

Glu Arg Val Lys Cys Val He Glu Ala Glu His Phe Cys Val Asp Arg 

115 120 125 

Val He Gly His Tyr Gin Glu Met Leu Pro Lys Ser Pro Thr Thr Arg 
130 135 140 



(2) INFORMATION FOR aZQ ID NO: 314: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:314: 



Met Ser Arg Leu Leu Val He Gly 

1 5 
He Ser Lys He Cys Gin Asp Ser 
20 

Ala Ser Arg Thr Lys Ser Lys Cys 

35 40 
Gly Lys Thr Ser Thr Lys He Glu 

50 55 
Val Glu Glu Val He Ala Leu He 
65 70 
Phe 



Cys Gly Gly Val Ala Gin Val Ala 

10 15 
Glu Thr Phe Thr Glu He Met He 
25 30 
Asp Asp Leu Lys Ala Lys Leu Glu 
45 

Thr Ala Ala Leu Asp Ala Asp Lys 
60 

Glu Ser Leu Gin Thr Ser Lys Leu 
75 80 
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(2) INFORMATION FOR SEQ ID NO:315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 15: 

Met His Glu Ser Phe His Val Asp Asp Pro Thr Leu Tyr Ser Arg Glu 

15 10 15 

Trp Phe Ser Trp Ala Asn Met Met Phe Cys Glu Leu Val Leu Asp Tyr 

20 25 30 

Leu Asp lie Arg 
35 

C2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 

Met Arg Phe Tyr Phe Met Glu Asn Val Val Val His lie He Ser His 

15 10 15 

Ser His Trp Asp Arg Glu Trp Tyr Leu Pro Phe Glu Ser His Arg Met 

20 25 30 

Gin Leu Val Glu Leu Phe Asp Asn Leu Phe Asp Leu Phe Glu Asn Asp 

35 40 45 

Pro Glu Phe Lys Ser Phe His Leu Asp Gly Gin Thr He Val Leu Asp 
50 55 60 

Asp 
65 



249 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO: 317: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 17: 

Met Leu Leu Tyr lie Leu Ser His lie Ala Thr Gly lie Val Ser Gly 

15 10 15 

Thr Cys Leu Leu Lys Ala lie Val Cys Asn Trp Trp Asn Cys Leu Thr 

20 25 30 

lie Ser Leu lie Ser Leu Lys Met Thr Leu Ser Ser Arg Val Ser Thr 

35 40 45 

Trp Met Asp Lys Leu Leu Ser Leu Thr Thr Asn Leu Gin lie Arg Pro 

50 55 60 

Glu Asn Arg Asp Lys Val Gin Arg Tyr lie Asp Glu Gly Lys Leu Lys 
65 70 75 80 

lie Gly Pro Phe Tyr lie Leu Gin Asp Asp Tyr Leu lie Ser Ser Glu 

85 90 95 

Ala Asn Val Arg Asn Thr Leu He Gly Gin Gin Glu Ala Ala Lys Trp 

100 105 110 

Gly Lys Ser Thr Gin He Gly Tyr Phe Pro Asp Thr Phe Gly Asn Met 

115 120 125 

Gly Gin Ala Pro Gin He Leu Gin Lys Ser Gly He His Val Ala Ala 

130 135 140 

Phe Gly Arg Gly Val Lys Pro He Gly Phe Asp Asn Gin Val Leu Glu 
145 150 155 160 

Asp Glu Gin Phe Thr Ser Gin Phe Ser Glu Met Tyr Trp Gin Gly Val 

165 170 175 

Asp Gly Ser Arg Val Leu Gly He Leu Phe Ala Asn Trp Tyr Ser Asn 

180 185 190 

Gly Asn Glu lie Pro Val Asp Lys Asp Glu Ala Leu Thr Phe Trp Lys 

195 200 205 

Gin Lys Leu Ser Asp Val Arg Cys Leu Arg Phe Asp Gin Pro Met Val 

210 215 220 

Asp Asp Glu Thr Ala Val Asn Thr Ser Leu Ser Gin Lys Lys Ser Glu 
225 230 235 240 

Arg Ser His Ser Val Trp Gin Met Asn Ser Ser Arg Met 
245 250 
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(2) INFORMATION FOR SEQ ID NO: 318: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Thr He Val Gly Cys Arg He Asp Gly Arg Leu He His Gly Gin 

Val Ala Asn Leu Trp Ala Gly Lys Leu Asn Val Ser Arg He Met Val 

20 25 30 

Val Asp Asp Glu Val Val Asn Asn Asp He Glu Lys Ser Gly Leu Lys 

35 40 45 

Leu Ala Thr Pro Pro Gly Val Lys Leu Ser He Leu Pro Val Glu Lys 

50 55 60 

Ala Ala Ala Asn He Leu Ala Gly Lys Tyr Asp Ser Gin Arg Leu 'he 
65 70 75 80 

He Val Ala Arg Lys Pro Asp Arg Phe Pro Trp Phe Gly Arg Ser Arg 
85 90 95 

Cys Thr Thr 



(2) INFORMATION FOR SEQ ID NO:319: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 157 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met He Gin His Pro Arg He Gly He Arg Pro Thr He Asp Gly Arg 

15 10 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 
20 25 30 
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Lys Ser Val Ala Asp Leu lie Ser Ser Thr Leu Lys Tyr Pro Asp Gly 

35 40 45 

Glu Pro Val Glu Cys Val lie Ser Pro Ser Thr He Gly Arg Val Pro 

50 55 60 

Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
65 70 75 80 

Thr He Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly Thr Glu 

100 105 no 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Ala Gin 

115 120 125 

Lys Gly He Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala 

130 135 140 

Asn Asp Thr Asp He Pro Glu Asp Val Lys Glu Asn Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

Met Leu Leu Ser Thr Lys Gly He Glu Glu Gin Glu He Ala Arg Pro 

15 10 15 

Thr Leu Glu Arg Leu Phe Ser Met Arg Glu Asn Tyr Lys Val Thr Gly 

20 25 30 

Arg His Pro Gly Tyr Arg Lys Tyr Asn Gly Asp Gly Ser Met Lys Glu 

35 40 45 

Thr Glu Lys 
50 

(2) INFORMATION FOR SEQ ID NO:321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Leu Thr Leu Thr Pro Asp Val lie Tyr Gin Glu Pro Asp He Leu 

15 10 15 

Tyr Gin His Glu Asp Phe Ser Leu Val Lys He Arg Gin He Arg Phe 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

Met Ser Ser Val Val Gly Leu Met Gly Asn lie Gly Gin Ala Asn Tyr 

15 10 is 

Ala Ala Ser Lys Ala Gly Leu He Gly Phe Thr Lys Ser Val Ala Arg 

20 25 30 

Glu Val Ala Ser Arg Asn lie Arg Val Asn Val He Ala Pro Gly Asn 
35 40 45 

Asp 



(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
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Met Lys Leu Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arg Gly 

15 10 15 

lie Gly Leu Ala lie Ala His Lys Phe Ala Gin Ala Gly Ala Asn lie 

20 25 30 

Val Leu Asn Ser Arg Gly Ala lie Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly lie Lys Val Val Pro lie Ser Gly Asp Val Ser Asp 

50 55 60 

Phe Ala Asp Ala Lys Arg Met lie Asp Gin Ala lie Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly lie Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Phe Gly Asn Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

Met Glu Ser Met Pro Arg lie Gly Leu Leu Val Thr Val Glu Asn Arg 

15 10 15 

Asp Thr Glu Thr lie Phe Asn Ala Ala Gly Leu Asp Phe Asp Val Leu 

20 '25 30 

Lys Ala Ser Ala lie Ala Tyr lie Asn Ala Asn Thr Phe Val Gin Lys 

35 40 45 

Glu Asn Ala Gly Glu Met Gly Arg Ser Val Ser Tyr His Asp Met Arg 

50 55 60 

Ser Val 
65 

(2) INFORMATION FOR SEQ ID NO: 325 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



Met Arg Gin Gly Gin Gly Ser Val Glu Ala lie Phe Asn Ala He Asp 

15 10 15 

Lys Phe Phe Asn Gin Ser Val Arg Leu Val Ser Tyr Thr He Asn Ala 

20 25 30 

Val Thr Asp Gly He Asp Ala Gin Asp Arg Val Val Gly His Cys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: si .gle 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Met Val Glu Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

15 10 15 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu Ala Asn Met 

20 25 30 

Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Thr Gly Phe Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 327: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:327: 

Met Asn Phe Gin Leu Ala Lys Tyr Ser Leu Leu Lys Lys Phe Ser Glu 

15 10 15 

Asn lie Gly Phe Thr Thr Pro Glu Glu Cys Gly Ala lie Phe Gin Tyr 

20 25 30 

Leu lie Glu Asn Val Gin Thr Asp Arg Gin lie lie Tyr Ser Pro Pro 

35 40 45 

Cys His Asp Glu Leu Arg Met Ala Val Ala Asn Ser Leu Ala Ala Val 

50 55 €0 

Lys Asn Gly Ala Gly Leu Phe Glu Glu Thr lie His Gly lie Arg Glu 
65 70 75 80 

Arg Ala Glu Asn Ala Ala Leu Glu Glu lie Ala Val Ala Leu Asn lie 

85 90 95 

Arg Gin Asp Tyr Tyr Gin Val Glu Thr Ser lie Val Leu Asn Glu Thr 

100 105 110 

He Asn Thr Ser Glu Met Val Ser Arg Phe Ser Gly He Pro Val Pro 

115 120 125 

Lys Asn Lys Ala Val Val Gly Gly Asn Thr Phe Ser His Glu Ser Gly 

130 135 140 

He His Gin Asp Gly Val Leu Lys Asn Pro Leu Thr Tyr Glu He He 
145 150 155 160 

Thr Pro Glu Leu Val Gly Val Lys He Pro Leu Gly Lys Leu Ser Gly 

165 170 175 

Arg His Ala Phe Val Glu Lys Leu Arg Glu Leu Ala Leu Asp Phe Thr 

180 185 190 

Glu Glu Asp He Lys Pro Phe Phe Ala Lys Phe Lys Ala Leu Ala Asp 
195 200 205 

Lys Lys 
210 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
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Met Ser Gin Gin Val Lys Asn Ala His Asn Leu Tyr He His Ala He 

1 5 io 15 

Gin Asp Gly Arg Val Ala Glu Ala Gin Ala Gin Ser Val Gly Asp Thr 

20 25 30 

Tyr He Gin His Ser Thr Gly Val Pro Asp Gly Lys Glu Gly Phe Ala 

35 40 45 

Ala Phe Phe Ala Asp Phe Phe Glu Arg His Pro Glu Arg Gin lie Lys 
50 55 



60 



He Val Arg Thr He Glu Asp Gly Asn Leu Val Phe Val His Val 



65 



70 



75 



Pro 
80 



Ser He Ser Glu Trp Trp Arg Ser Ser Met Gly Asp Asp Gly Tyr Phe 
85 90 

Pro Cys Gly 



95 



(2) INFORMATION FOR SEQ ID NO:329: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRAT JEDNESS ; single 

(D) TOPf jOGY : linear 

(iij MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
Met Glu Met Gin Thr Ser Ala Gin Leu Leu Thr Asn Lys He Phe Leu 

Tyr 



5 io 15 

Lys Asn Pro Leu Lys Ala Leu Val Glu Glu Lys Tyr Gly lie Glu 



20 25 30 

Glu Glu Phe Thr Asn Pro Trp His Ala Ala He Ser Ser Phe Val Ala 

35 40 45 

Phe Phe Leu Arg Ser Leu Pro Pro Met Leu Ser Val Thr He Phe Pro 

50 55 60 

Ser Glu Tyr Arg He Pro Ala Thr Val Leu lie Val Gly Val Ala Leu 



65 



70 7 c 

75 80 



Leu Leu Thr Gly Tyr Thr Ser Ala Arg Leu Gly Lys Asp Pro Thr 



85 



Arg 



90 



95 



Thr Ala Met lie Arg Asn Leu Ala lie Gly Leu Leu Thr Met Gly Val 

100 105 "0 

Thr Phe Leu Leu Glu Gin Leu Phe Ser He 
"5 120 
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(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Ala Phe Phe Leu Cys Pro His Phe Arg Ser Asn His Trp Lys lie 

15 10 15 

Leu Thr Val Ser Asp Thr Met Glu Glu Lys Arg Leu Glu Tyr Pro Met 

20 25 30 

Val Thr Phe Leu Gly Asn Pro Val Ser Phe Thr Gly Lys Gin Leu Gin 

35 40 45 

Val Gly Asp Lys Ala Leu Asp Phe Ser Leu Thr Thr Thr Asp Leu Ser 

50 55 60 

Lys Lys Ser Leu Ala Asp Phe Asp Gly Lys Lys Lys Val Leu Ser Val 
65 70 75 B0 

Val Pro Ser lie Asp Thr Gly He Cys Ser Thr Gin Thr Arg Arg Phe 

85 90 95 

Asn Glu Glu Leu Ala Gly Leu Asp Asn Thr Val Val Leu Thr Val Ser 

100 105 110 

Met Asp Leu Pro Phe Ala Gin Lys Arg Trp Cys Gly Ala Glu Gly Leu 

115 120 125 

Asp Asn Ala He Met Leu Ser Asp Tyr Phe Asp His Ser Phe Gly Arg 

130 135 140 

Asp Tyr Ala Leu Leu He Asn Glu Trp His Leu Leu Ala Arg Ala Val 
145 150 155 160 

Phe Val Leu Asp Thr Asp Asn Thr He Arg Tyr Val Glu Tyr Val Asp 



165 



170 



175 



Asn He 



Asn 



Ser 



Glu 



Pro 



Asn 



Phe 



Glu Ala Ala He Ala Ala 



Ala 



Lys 



180 



185 



190 



Ala Leu 



(2) INFORMATION FOR SEQ ID NO: 331: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Met Gly Tyr Arg Pro Ser Thr Ala Asn Ala He He His Gin Val Arg 

15 10 15 

Glu Leu Leu Val Ser Arg Gly Tyr Thr Phe Tyr Asn Arg Lys Arg Leu 

20 25 3 o 

Met Val Val Pro Lys Ser Val Val Lys Glu Leu Leu Gly Met Glu Leu 
35 40 45 

<2) INFORMATION FOR SEQ ID NO:332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 : 

Met Gin Ala Val Glu His Phe lie Lys Gin Phe Val Pro Glu His Tyr 

Asp Leu Phe Leu Asp Leu Ser Arg Glu Thr Lys Thr Phe Ser Gly Lys 

20 25 30 

Val Thr lie Thr Gly Gin Ala Gin Ser Asp Arg lie ser Leu His Gin 

35 40 45 

Lys Asp Leu Glu He Thr Ser Val Glu Val Ala Gly Gin Ala Arg Pro 

50 55 60 

Phe Thr Val Asp His Asp Asn Glu Ala Leu His lie Glu Leu Ala Glu 
" 70 75 80 

Ala Gly Gin Val Glu Leu Val Leu Ala Phe Ser Gly Lys lie Thr Asp 

85 90 95 

Asn Met Thr Gly He Tyr Pro Ser Tyr Tyr Thr Val Asp Gly Val Lys 

ioo 105 no 

Lys Glu Val Leu Ser Thr Gin Phe Glu Ser His Phe Ala Arg Glu Ala 
115 120 125 
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Phe Pro Cys Val Asp Glu Pro Glu 

130 135 
Leu Arg Phe Asp Gin Ala Glu Gly 
145 150 
Glu lie Asp Val Glu Asn Arg Lys 
165 

Thr Thr Pro Arg Met Ser Ser Tyr 
180 

Leu Gin Gly Val Thr Ala Lys Thr 
195 200 
Leu Leu Asn Gin Ser Thr Ser Thr 
210 215 
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Ala Lys Ala Thr Phe Asp Leu Ser 
140 

Glu Leu Ala Leu Ser Asn Met Pro 
155 160 
Glu Thr Gly He Trp Lys Phe Glu 

170 175 
Leu Leu Ala Phe Val Ala Gly Asp 
185 190 
Lys Asn Gly Thr Leu Val Gly Cys 
205 

Phe Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: 



Met Ser Gin Glu Phe Leu Ala Arg He Leu Glu Gin Lys Ala Arg Glu 

15 10 15 

Val Glu Gin Met Lys Leu Glu Gin He Gin Pro Leu Arg Gin Thr Tyr 

20 25 30 

Arg Leu Ala Glu Phe Leu Lys Asn His Gin Asp Arg Leu Gin Val He 

35 40 45 

Ala Glu Val Lys Lys Ala Ser Pro Ser Phe Gly Glu He Ser He Ser 

50 55 60 

Met Trp He Leu Cys Asn Arg Pro Arg Leu Met Lys Lys Thr Glu Gin 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 334: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 



Met Ser Ser Phe Thr Pro Glu Asp 

1 5 
Glu Asp He Arg Gly Gly Asn Ala 
20 

Ser Val Leu Lys Asn Glu Ala Ser 

35 40 
Asn Ala Gly Leu Gly Phe Tyr Ala 

50 55 
Glu Gly Val Ala Leu Ala Arg Gin 
65 70 
Glu Lys Leu Arg Leu Leu Gin Glu 
85 



Leu Gly Met Glu Gly Tyr Ala Met 

10 15 
Gin Glu Asn Ala Glu He Leu Leu 
25 30 
Pro Phe Leu Glu Thr Thr Val Leu 
45 

Asn Gly Lys He Asp Ser He Lys 
60 

Val He Ala Arg Gly Lys Ala Leu 

75 80 
Tyr Gin Lys 
90 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 



Met lie Tyr Thr Val Thr Leu Asn Pro Ser lie Asp Tyr lie Val Arg 

15 10 15 

Leu Asp Gin Val Lys Val Gly Ser Val Asn Arg Met Asp Ser Asp Asp 

20 25 30 

Lys Phe Ala Gly Gly Lys Gly lie Asn Val Ser Arg Val Leu Lys Arg 

35 40 45 

Leu Asn lie Ser Asn Thr Ala Thr Gly Phe lie Gly Gly Phe Thr Gly 

50 55 60 

Lys Phe Xaa Xaa Asp Thr Leu Ala Glu Glu Glu He Glu Xaa Arg Phe 
65 70 75 80 

Val Gin Val Ala Glu Asp Thr Arg lie Asn Val Lys lie Xaa Ala Asp 

85 90 95 

Gin Glu Thr Glu He Asn Gly Thr Gly Pro Thr Val Glu Pro Val Lys 
1°° 105 no 
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Leu Glu Glu Leu Lys Ala He Leu Ser Ser Leu Thr Ala Glu Asp Thr 

115 120 125 

Val Val Phe Ala Gly Ser Ser Ala Lys Asn Leu Gly Asn Val He Tyr 

130 135 140 

Lys Gly Phe Asn Leu Leu Asp Ala Pro Asp Trp Cys Ala Ser Gly Leu 
"5 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 336: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



Met Met Gin Val Tyr Phe Asp Gin Gly He Tyr Asn Lys Lys Ala Val 

15 10 15 

Phe Glu Val Tyr Phe Arg Gin Gin Pro Ph .. Lys Asn Gly Tyr Ala Val 

20 25 30 

Phe Ala Gly Leu Glu Arg He Val Asn Tyr Leu Glu Asp Leu Arg Phe 

35 40 45 

Ser Asp Ser Asp lie Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly Ala 

50 55 60 

Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg Ser 
65 70 75 80 

Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro lie Val Gin Val 

85 90 95 

Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu Asn 

100 105 HO 

He Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Pro Tyr Pro 

115 120 125 

Phe Gly Tyr Arg Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:337: 

Met Lys He He lie Gin Arg Val Lys Lys Ala Gin Val Ser He Glu 

15 10 15 

Gly Gin He Gin Gly Lys He Asn Gin Gly Leu Leu Leu Leu Val Gly 

20 25 30 

Val Gly Pro Glu Asp Gin Glu Glu Asp Leu Asp Tyr Ala Val Arg Lys 

35 40 45 

Leu Val Asn Met Arg He Phe Ser Asp Val Glu Gly Lys Met Asn Leu 

50 55 60 

Ser Val Lys Asp He Glu Gly Glu He Leu Ser He Leu Ser Leu Pro 
65 70 75 80 

Leu Cys Gly Tyr 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met He Lys Gly Thr lie Lys Thr Val Ser lie Met Ala Ala Thr Ser 

15 10 15 

Gly Phe Leu Leu Tyr Asn Glu Val Phe Phe Leu Thr Asn Gly Ala Ala 

20 25 30 

Gly Thr Lys Ser He Ser Phe Val He Arg Glu Leu Ala Val Ala Ser 

35 40 45 

Ser Arg Thr Gin Tyr Ala Arg Ala Asn Thr He Gly Val He Gin He 

50 55 60 

Leu Gly Gly Met Leu He He Val Cys He Asn He Leu Phe Arg Glu 

65 7 ° 75 so 

Arg Lys Arg Leu Lys Gly Gly Lys 
85 
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(2) INFORMATION FOR SEQ ID NO: 339: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:339: 

Met Gly Phe Gly He Pro Ala Ala He Gly Ala Lys He Ala Asn Pro 

15 10 15 

Asp Lys Glu Val Val Leu Phe Val Gly Asp Gly Gly Phe Gin Met Thr 

20 25 30 

Asn Gin Glu Leu Ala He Leu Asn He Tyr Lys Val Pro He Lys Val 

35 40 45 

Val Met Leu Asn Asn His Ser Leu Gly Met Val Arg Gin Trp Gin Glu 

50 55 60 

Ser Phe Tyr Glu Gly Arg Thr Ser Glu Ser Val Phe Asp Thr Leu Pro 
65 70 75 80 

Asp Phe Gin Leu Met Ala Gl.. Ala Tyr Gly He Lys Asn Tyr Lys Phe 

85 90 95 

Asp Asn Pro Glu Thr Leu Ala Gin Asp Leu Glu Ala Thr Thr Glu Asp 

100 105 110 

Val Pro Met Leu He Glu Val Asp He Ser Arg Lys Glu Gin Val Leu 

115 120 125 

Pro Met Val Pro Ala Gly Lys Ser Asn His Glu Met Leu Gly Val Lys 

130 135 140 

Phe His Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:340: 
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Met Trp Leu Pro Leu Asp Arg Asn Asn Lys Gly Gin Asn He Ala Gly 

Ala Arg Gin Ala Ala Glu Gly He Phe Gly Val Asp Ala Ser Gin Leu 

20 25 3 0 

Thr Val Pro Gin Ala Ala Phe Leu Ala Gly Leu Pro Gin Ser Pro lie 

35 40 45 

Thr Tyr Ser Pro Tyr Glu Asn Thr Gly Glu Leu Lys Asn 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 341: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 : 

Met Tyr Leu Gly Asp Leu Met Glu Lys Ala Glu Cys Gly Gin Phe Ser 

lie Leu Ser Phe Leu Leu Gin Glu Ser Gin Thr Thr Val Lys Ala Val 

20 25 30 

Met Glu Glu Thr Gly Phe Ser Lys Ala Thr Leu Thr Lys Tyr Val Thr 

35 40 45 

Leu Leu Asn Asp Lys Ala Leu Asp Ser Gly Leu Glu Leu Thr lie His 

50 55 60 

Ser Glu Asp Glu Asn Leu Arg Leu Ser lie Gly Ala Ala Thr Lys Gly 



65 70 75 



80 

95 



Arg Asp He Pro Glu Leu Val Phe Trp lie Val Leu Leu Asn Thr Arg 

85 90 
Phe Trp Phe lie Phe Ser Thr Thr Asn Ser Phe 
100 105 

(2) INFORMATION FOR SEQ ID NO:342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:342: 

Met Lys Glu lie lie Glu Lys Leu Ala Lys Phe Glu Asn Leu Ser Gly 

15 10 15 

Val Glu Met Thr Asp Val He Glu Arg He Val Thr Gly Arg Val Thr 

20 25 30 

Glu Ala Gin He Ala Ser Leu Leu Leu Ala Leu Lys Met Lys Gly Glu 

35 40 45 

Thr Pro Glu Glu Arg Thr Ala He Ala Gin Val Met Arg Gly His Ala 

50 55 60 

Gin His He Pro Thr Glu lie His Asp Ala Met Asp Asn Cys Gly Thr 
65 70 75 80 

Gly Gly Asp Lys Ser Phe Ser Phe Asn He Ser Thr Thr Ala Ala Phe 

85 90 95 

Val Leu Ala Gly Gly Gly He His Met Ala Lys His Gly Asn Arg Ser 

100 105 110 

He Ser Ser Lys Ser Gly Ser Ala Asp Ser Xaa Asn Leu Gly Asn Gin 
115 120 125 

Ser 



(2) INFORMATION FOR SEQ ID NO: 343: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:343: 

Met Asn Val Gin Met Ser Ser Lys Thr Asn He Leu Arg Ala His Ala 

15 10 15 

Glu Met Gin Asn He Gin Arg Arg Ala Asn Glu Glu Arg Gin Asn Leu 

20 25 30 

Gin Arg Tyr Arg Ser Gin Asp Leu Ala Lys Ala He Leu Pro Ser Leu 

35 40 45 

Asp Asn Leu Glu Arg Ala Leu Ala Val Glu Gly Leu Thr Asp Asp Val 
50 55 60 
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Lys Lys Gly Leu Gly Met Val Gin 
65 70 
Glu Glu Gly lie Glu Glu He Ala 
85 

Tyr His Met Ala He Gin Thr Leu 
100 

Asp Thr He Ala Gin Val Phe Gin 
115 120 
He Leu Arg Pro Ala Met Val Val 
130 135 
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Glu Ser Leu He His Ala Leu Lys 

75 80 
Ala Asp Gly Glu Phe Asp His Asn 

90 95 
Pro Ala Asp Asp Asp His Pro Val 
105 no 
Lys Gly Tyr Lys Leu His Asp Arg 
125 

Val Tyr Asn 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 344: 



Met Ser Asn Lys Ser Tyr Ser Lys 

1 5 
Tyr Gly Phe He Gly Leu Lys Leu 
20 

Val Ser Glu Glu Phe His Ser Glu 

35 40 
Tyr Asp He Ser Val 
50 



Phe Gin Glu Glu Val Ser Leu Lys 

10 15 
Asp Lys Leu Ser Leu Thr Ala Glu 
25 30 
He Leu Ser Gly Asn Phe Thr Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
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Met His Phe Asp Lys Ser Lys Phe Gly Ala Val Phe Ser Ala Pro Gly 

15 10 15 

Leu Tyr Glu Val Glu Val lie Asn Asn Ala Ser Phe Gly Gin Asn Ala 

20 25 30 

Gin Tyr Glu Val lie Gin Ser Arg Lys Leu Gly Thr Phe Ala Glc Leu 

35 40 45 

lie Glu Met Ala Lys He Lys 
50 55 

(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS : 
<A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

M it He Gin Ala Val Phe Glu Arg Ala Glu Asp Gly Glu Leu Arg Ser 

15 10 15 

Ala Glu He Thr Gly His Ala Glu Ser Gly Glu Tyr Gly Leu Asp Val 

20 25 30 

Val Cys Ala Ser Val Ser Thr Leu Ala He Asn Phe He Asn Ser He 

35 40 45 

Glu Lys Phe Ala Gly Tyr Glu Pro He Leu Glu Leu Asn Glu Asp Glu 

50 55 60 

Gly Gly Tyr Leu Met Val Glu He Pro Lys Asp Leu Pro Ser His Gin 
65 70 75 80 

Arg Glu Met Thr Gin Leu Phe Phe Glu Ser Phe Phe Leu Gly Met Ala 

85 90 95 

Asn Leu Ser Glu Asn Ser Ser Glu Phe Val Gin Thr Arg Val He Thr 
100 105 110 

Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met Thr Ala He Ser Met Lys Trp Leu Asn Asp Tyr He Trp Pro Ala 

15 10 15 

Glu Ser Glu Phe Thr Pro Asp Met Thr Thr Asn Ala Val Lys Glu Ala 

20 25 30 

Leu Thr Glu Met Leu Gin Ser Gly Thr Thr Thr Phe Asn Asp Met Tyr 

35 40 45 

Asn Pro Asn Gly Val Asp lie Gin Gin He Tyr Gin Val Val Lys Thr 

50 55 60 

Ser Lys Met Arg Cys Tyr Phe Ser Pro Thr Leu Phe Ser Ser Glu Thr 
65 70 75 80 

Glu Thr Thr Ala Glu Thr lie Ser Arg Thr Arg Ser lie lie Asp Glu 

85 90 95 

He Leu Lys Tyr Lys Asn Pro Lys Phe Gin Gly Leu Trp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
Met Val Arg Glu Ser Ala Glu Ser Ala Gly Phe Phe Leu Glu Thr His 

Asp 



10 15 
Met Val Gin Gly Glu Trp Asn Thr Cys Val Phe Lys Lys Thr Lys 

20 25 30 

He Ser Gly Val He Gly Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 3 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 



Met Thr Val Asp Arg Glu Gly Phe 

1 5 
Glu Arg Ala Arg Ala Ser Ala Val 
20 

Asn Gly Asn Ser Ser Lys His His 
35 40 

Cys 



Glu Ala Ala Met Lys Glu Gin Gin 

10 15 
Lys Gly Gly Ser Met Gly Met Gin 
25 30 
Cys Arg Lys Cys Leu Gin Leu Gin 
45 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



Met Lys Ala Gin Leu Val Pro Ser Arg Ser Lys Tyr Leu lie Asn Asp 

Asn Thr Val Val Leu Phe Phe Gly Thr Arg Thr Asp Tyr Thr Arg Lys 

20 25 30 

Tyr Asp Leu Asp Leu Val Arg Glu Val Ala Gly Asp Gin lie Ala Arg 

35 40 45 

Arg Val Val Leu Leu Ser Asp Gin Ala Phe Gly Leu Glu Asn Val ' Lys 

5 ° 55 60 

Glu Val Ala Leu Gly Cys Gly Gly Val Leu Asn Asp lie Tyr Arg Val 
65 70 75 80 

Phe Pro Tyr lie Val Tyr Ala Gin Leu Phe Ala Leu Leu Thr Ser Leu 

85 90 95 

Lys Val Glu Asn Lys Pro Asp Thr Pro Ser Pro Thr Gly Thr Val Asn 
100 105 no 
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Arg Val Val Gin Gly Val He He His Glu Tyr Gin Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met Ala Val Asn Asn Glu He Gly Ser He Gin Xaa He Glu Ala He 

15 10 15 

Ser Glu Phe Leu Ala Asp Lys Pro Thr He Ser Phe His Val Asp Ala 

20 25 30 

Val Gin Ala Leu Ala Lys He Pro Thr Glu Lys Tyr Leu Thr Glu Arg 

35 40 45 

Val Asp Cys Ala Thr Phe Ser Ser Hir Lys Phe His Gly Val Arg Gly 

50 55 60 

Val Gly Phe Val Tyr He Lys Ser Gly Lys Lys He Thr Pro Leu Leu 
65 70 75 80 

Thr Gly Gly Gly Gin Glu Arg Asp Tyr Arg Ser Thr Thr Glu Asn Val 

85 90 95 

Ala Gly He Ala Ala Thr Ala Lys Ala Leu Arg Leu Ser Met Glu Lys 

100 105 no 

Leu Asp He Phe Arg Ser Lys Thr Gly Gin Met Lys Ala Val He His 

115 120 125 

Gin Ala Leu Leu Asn Tyr Pro Asp He Phe Val Phe Ser Asp Glu Glu 

130 135 140 

Asp Phe Ala Pro His He Leu Thr Phe Gly lie Lys Gly Val Arg Gly 
145 150 155 leo 

Glu Val He Val His Ala Phe Glu Asp Tyr Asp He Phe lie Ser Thr 

165 170 175 

Thr Ser Ala Cys Ser Ser Lys Ala Gly Lys Pro Ala Gly Thr Leu He 

180 185 190 

Ala Met Gly Val Asp Lys Asp Lys Ala Lys Ser Ala Val Arg Leu Ser 

195 200 205 

Leu Asp Leu Glu Asn Asp Met Ser Gin Val Glu Gin Phe Leu Thr Lys 
210 215 220 
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Leu Lys Leu lie Tyr Asn Gin Thr Arg Lys Val Arg 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Met Leu Val Thr Gly Ala lie Leu Gly Val Asn Val His lie Phe Ser 

15 10 15 

Pro Lys Glu Leu Phe Pro Glu Lys Glu lie Val Glu Leu Ala Glu Gly 

20 25 30 

Phe Ala Lys Glu Ser Gly Ala His Val Leu lie Thr Glu Asp Ala Asp 

35 4C 45 

Glu Ala Val Lys Asp Ala Asp Val Leu Tyr Thr Asp Val Trp Val Ser 

50 55 60 

Met Gly Glu Glu Asp Lys Phe Ala Glu Arg Val Ala Leu Leu Lys Pro 
65 70 75 80 

Tyr Gin Val Asn Met Asp Leu Val Lys Lys Ala Gly Asn Glu Asn Leu 

85 90 95 

lie Phe Leu His Cys Leu Pro Ala Phe His Asp Thr His Thr Val Tyr 

100 105 110 

Gly Lys Asp Val Ala Glu Lys Phe Gly Val Glu Glu Met Glu Val Thr 

115 120 125 

Asp Glu Val Phe Arg Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu 

130 135 140 

Asn Arg Met His Thr lie Lys Ala Val Met Ala Ala Thr Leu Gly Asn 
145 150 155 160 

Leu Tyr lie Pro Lys Val 
165 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys lie Lys Glu Lys 

15 10 15 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 30 

Glu Lys Ser lie Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr lie Val Tyr Leu Gly 

50 55 60 

Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 70 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His lie Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 110 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp ;ly Tyr Lys Val Glu Val lie Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 173 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 

Met Asp His Thr lie Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp 

15 10 15 

Leu Glu Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro 

20 25 30 

Phe Gly Pro Leu Phe Gly Lys He Lys Asn Gly Gin Asp Leu Val Leu 
35 40 45 
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Glu Asp Gly Thr Glu 
50 

Pro Gly Lys He He 
65 

Ser Val Arg Leu Ala 
85 

Tyr Gly Lys Gly Asp 
100 

Asn Met Gin Ala Ala 
115 

Leu Leu Asn His He 
130 

Leu Lys Lys Asp Ala 
145 

Asp Leu Glu Lys Met 
165 



He Lys Ala Ala Asp Tyr 
55 

Thr He Leu Gly Asp Thr 
70 75 
Val Asn Ala Asp Val Leu 
90 

Glu Lys He Ala Arg Asn 
105 

Gin Val Ala Val Glu Ala 
120 

Ser Ala Arg Phe Leu Ser 
135 

Ala Thr He Phe Glu Asn 
150 155 
Glu He Tyr Gin Ser Gin 
170 
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He Ser Ala Pro Arg 
60 

Arg Lys Thr Asp Ala 
80 

Val His Glu Ser Thr 
95 

His Gly His Ser Thr 
110 

Gly Ala Lys Arg Leu 
125 

Lys Asp He Ser Lys 
140 

Val His Val Val Lys 
160 

Lys Gly 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 



Met He Gin Pro Ala Ser Leu Glu Glu Leu Ala Ser Leu Val Glu Lys 

15 10 15 

Ala Gly Lys Lys Val Phe Leu Phe Val Ala Asp Trp Cys Gly Asp Cys 

20 25 30 

Arg Tyr He Tyr Pro Ala Leu Pro Glu He Glu Glu Thr Asn Pro Glu 

35 40 45 

Phe Thr Phe He Arg Met Asp Arg Asp Gin Tyr Met Asp Leu Ala Lys 

50 55 60 

Leu Trp Asp Val Tyr Gly He Pro Ser Leu Val Val Leu Glu Lys Asp 
65 70 75 80 

Lys Glu He Gly Arg Phe Val Asn Arg Asp Arg Lys Ser Lys Glu Gin 

85 90 95 

He Asn Asp Phe Leu Ala Gly Leu Lys 
100 105 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Arg Leu Gly Leu Leu Gin Val Leu Arg Leu Pro Lys Ala Phe Gin 

15 10 15 

Leu lie Phe lie Gin Asp Lys Gly His Gly Asp Val Ser Ser Phe Thr 

20 25 30 

Ala Ser Cys Val Thr Gly Pro Trp Ala Phe Lys Leu Val Gly Lys Gin 

35 40 45 

Gly Asn lie His 
50 

(2) INFORMATION FOR SEQ ID NO: 3 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 10 15 

Val lie Asp Lys Arg Asp Leu Leu Arg lie Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe lie Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

lie Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 
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Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu 
85 90 
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Leu Gly Leu 
95 



Glu 



(2) INFORMATION FOR SEQ ID NO: 358: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

Met Arg Val Ala Tyr Phe Lys Val His His Pro He Tyr Tyr Tyr Cys 

15 10 15 

Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Asp He Lys Thr Met Gly 

20 25 30 

Ala Gly Leu Glu Val He Lys Arg Arg Met Glu Glu He Ser G3 * Lys 

35 40 45 

Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 
65 70 75 80 

Leu Asp Leu Tyr Cys Ser Gin Thr Thr Glu Phe Leu He Asp Gly Asp 

85 90 95 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu Asn Val 

100 105 HO 

Ala Lys Gin Leu Val Arg Ala Arg Glu Glu Gly Glu Phe Leu Ser Lys 

115 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 

130 135 140 

Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu He Thr Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

Met Ser Ser Lys He Ser He Gly Gin Leu He Thr Phe Asn Thr Leu 

15 10 15 

Leu Ser Tyr Phe Thr Thr Pro Met Glu Asn He He Asn Leu Gin Thr 

20 25 30 

Lys Leu Gin Ser Ala Lys Val Ala Asn Asn Arg Leu Asn Glu Val Tyr 

35 40 45 

Leu Val Glu Ser Glu Phe Gin Val Gin Glu Asn Pro Val His Ser His 

50 55 60 

Phe Leu Met Gly Asp He Glu Phe Asp Asp Leu Ser Tyr Lys Tyr Gly 
65 70 75 80 

Phe Gly Arg Asp Thr Leu Thr Asp He Asn Leu Thr He Lys Gin Gly 

85 90 95 

Asp Lys Val Ser Leu Val Gly Val Ser Gly Ser Gly Lys Thr Thr Leu 

100 105 110 

Ala Lys Met He Val Asn Phe Phe Glu Pro Tyr Lys Gly His He Ser 

115 120 125 

He Asn His Gin Asp He Lys Asn He Asp Lys Lys Ser Leu Ala Pro 

130 135 140 

Ser Tyr 
145 

(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

Met Arg Gin Ala Pro Ala Leu Ala Gin Ala Asn He Glu Arg Val Val 

15 10 15 

Val His Lys He Ser Lys Val Trp Glu Phe His Phe Arg He Phe 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 361 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg 

15 10 15 

Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp 

20 25 30 

Tyr Tyr Leu Asp Gin Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn 

35 40 45 

Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr 

50 55 60 

Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp Gin Pro Asn 
65 70 75 80 

Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp Tyr Tyr 

85 90 95 

Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp Gly Ser 

100 105 110 

Thr Trp Tyr Tyr Leu Asn Ala Val Met Glu lie 
115 120 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 

Met Lys Trp Gly Ser Arg He Leu Leu Gly Leu Thr Pro Lys Ser Leu 
15 10 15 
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Arg Tyr Arg lie Trp Lys Lys Ala Glu Lys Glu Met Thr Lys Tyr Asp 

20 25 30 

Leu Ala Asp Cys Asp Gly lie Thr Glu Leu Cys Ser Gly Pro Gly Tyr 

35 40 45 

Met Arg Asn Lys Tyr Pro He Thr Ser Phe Glu Asp Asn Leu Phe Leu 

50 55 60 

Pro Phe Glu Gly Thr Glu Met Pro He Pro He Gly Tyr Asp Val Tyr 
65 70 75 80 

Leu Arg Thr Ala Phe Gly Asp Tyr Met Thr Pro Pro Pro Ala Asp Lys 

85 90 95 

Gin Val Pro His Gin Asp Ala Val He Ala Asp Met Asp Lys Ser Tyr 

100 105 110 

Thr Glu Tyr Lys Gly Glu Tyr Gly Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

Met Phe Pro Asp Ser He Asp Thr Met His Gin Ala Asn Glu Phe He 

15 10 15 

Ala Leu Asp Asp Leu Phe Arg Ala Ala Ala He Tyr Ala Glu Ala He 

20 25 30 

Tyr Glu Leu He Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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Met Asp Glu Leu Gin 

1 5 
Phe Gin Leu Phe Pro 
20 

Pro Val Lys Thr Met 
35 

Ser Gly Leu Leu Glu 
50 

Pro Phe Ser Leu Ser 
65 

Ala Met Met He Asp 
85 

Ala Leu Asp Pro Glu 
100 

Asn Arg Glu Leu Gly 

115 120 

Phe Gly 
130 



Gly Phe Val Phe Gin Asp 
15 

Glu Asn Leu Thr Leu Ser 
30 

Glu Ala Glu Lys Lys Ala 
45 

Gly His Ala Glu Ser Tyr 
60 

Arg Val Ala Leu Ala Arg 
75 80 
Tyr Asp Glu Pro Thr Ser 
95 

Glu Lys Leu He Leu Gin 
110 

Val Thr His Asp Leu Gin 
125 



Lys Arg Asn Leu Leu 
10 

His Leu Ser Val Leu 
25 

Gly Met Lys Gin Glu 
40 

Gin Leu Gly Leu Gly 
55 

Gly Gly Gin Lys Gin 
70 

Pro Glu He He Gly 
90 

Leu Arg Leu Glu Val 
105 

Met Thr His He Val 



(2) INFORMATION FOR SEQ tD NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:365: 

Met Tyr Gly Ala Asn Ser Glu Arg Leu Lys Glu Ser Leu Arg He Ser 

1 5 10 15 

Leu Ser Pro Gin Asn Thr Val Glu Asp Leu Gin Thr Leu Ala Lys Thr 

20 25 30 

Leu Lys Glu He He Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met lie Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

1 5 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr lie Tyr Leu Leu Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 367 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 : 

Met Lys His Phe Asp Thr lie Val He Gly Gly Gly Pro Ala Gly Met 

15 10 15 

Met Ala Thr He Ser Ser Ser Phe Tyr Gly Gin Lys Thr Leu Leu He 

20 25 30 

Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 

35 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 

50 55 60 

He Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
65 70 75 80 

Asn His Asp He He Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 

85 90 95 

Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 

100 105 110 

lie He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 
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Ala Thr Pro Asn Arg Asn Arg Phe Cys 
130 135 
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(2) INFORMATION FOR SEQ ID NO: 368: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 96 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



Met Thr Leu Ala Lys Asp lie Ala 

1 5 

Val Tyr Leu Lys Pro Glu Glu Pro 
20 

Ser Pro lie Tyr Thr Asp Asn Arg 

35 40 

Arg Thr Leu lie Gl 1 Asn Gly Phe 

50 55 

Pro Glu Val Glu Val He Ala Gly 

65 70 

Gly Ala He He Ala Asp Lys Met 
85 



Ser His Leu Leu Lys He Gin Ala 

10 15 

Phe Thr Trp Ala Ser Gly He Lys 
25 30 

Val Thr Leu Ala Tyr Pro Glu Thr 
45 

Val Glu Ala He Lys Glu Ala Phe 
60 

Thr Ala Thr Ala Gly He Pro His 

75 80 

Asp Leu Pro Phe Ala Tyr He Pro 
90 95 



(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:369: 



Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 

15 10 15 

Val He Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 
20 25 30 
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Asp Phe Thr Val Asn Ser Cys Ser 

35 40 
Val Gly Tyr Asp lie Val lie Ala 

50 55 
Glu Gly Arg Thr Asn Gly Lys Leu 
65 70 
Asp Lys Glu lie Thr Glu Lys Leu 
85 
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Val Gly Glu Ala Lys Gly Leu Ala 
45 

Ser Leu His Leu lie Gin Glu Leu 
60 

He Gly Leu Asp Asn Leu Met Asp 

75 80 
Ser Gin Ala He Gin 
90 



(2) INFORMATION FOR SEQ ID NO: 370: 



<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



Met Ser Met 
1 

Gly Gly Asp 

Gly Gly Pro 
35 

Val Val Asp 
50 

His Phe Lys 
65 

Gly Arg Gly 

Val Arg Asp 

Gly Gin Glu 
115 

Tyr Ser Phe 
130 



Phe Leu Asp Thr 

5 

Gly Met Val Ala 
20 

Trp Gly Gly Asp 

Glu Gly Leu Arg 
55 

Ala Asp Ser Gly 
70 

Ala Glu Asp Leu 
85 

Ala Glu Thr Gly 
100 

Phe He Val Ala 

Arg Asp Thr Lys 
135 



Ala Lys He 
10 

Phe Arg Arg 
25 

Gly Gly Arg 
40 

Thr Leu Met 

Glu Lys Gly 

Arg Val Arg 
90 

Lys Val Leu 
105 

His Gly Gly 
120 

Lys Ser Cys 



Lys Val Lys 

Glu Lys Tyr 

Gly Gly Asn 
45 

Asp Phe Arg 
60 

Met Thr Lys 
75 

Val Pro Gin 

Thr Asp Leu 

Pro Trp Trp 
125 

Thr Gly Asn 
140 



Ala Gly Asn 
15 

Val Pro Asn 
30 

Val Val Phe 

Tyr Asn Arg 

Gly Met His 
80 

Gly Thr Thr 
95 

He Glu His 
110 

Thr Trp Lys 
Leu 



(2) INFORMATION FOR SEQ ID NO:371: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 210 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

Met Met Lys Gly Glu Met Thr Phe Lys Gin Val His Tyr Lys Tyr Gly 

15 10 15 

Tyr Gly Arg Asp Val Leu Ser Asp He Asn Leu Thr Val Pro Gin Gly 

20 25 30 

Ser Lys Val Ala Phe Val Gly He Ser Gly Ser Gly Lys Thr Thr Leu 

35 40 45 

Ala Lys Met Met Val Asn Phe Tyr Asp Pro Ser Gin Gly Glu He Ser 

50 55 60 

Leu Gly Gly Val Asn Leu Asn Gin He Asp Lys Lys Ala Leu Arg Gin 
65 70 75 80 

Tyr He He Tyr Leu Pro Pro Gin Pro Tyr Val Phe Asn Gly Thr He 

85 90 95 

Leu Glu Asn Leu Leu Leu Gly Ala Lys Gly Gly Asp Asp Thr Gly Lys 

100 105 110 

He Sea. Leu Thr Gly Arg Ser Glu Phe Gly Ser Glu He Pro Lys Arg 

115 120 125 

Asn He Ser Lys Pro Pro Cys His Leu Glu He Tyr Gin Thr Glu Leu 

130 135 140 

Thr Ser Asp Gly Ala Gly lie Ser Gly Gly Gin Arg Gin Arg He Ala 
145 150 155 160 

Leu Ala Arg Ala Leu Leu Thr Asp Ala Pro Val Leu He Leu Asp Glu 

165 170 175 

Ala Thr Asn Ser Leu Asp He Leu Thr Lys Lys Arg He Val His Tyr 

180 185 190 

Leu Met Ala Leu Asp Lys Thr Phe Asp Phe His Cys Ser Pro Leu Asp 
195 200 205 

Tyr Cys 
210 

(2) INFORMATION FOR SEQ ID NO: 372: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Met Ala Arg Phe He Glu He Asp Ala Lys Glu His Asp Arg Val Thr 

1 5 10 15 

Ser Gin He Ser His Phe Pro His He Leu Ala Ser Ser Leu Met Glu 

20 25 30 

Gin Thr Ala Val Tyr Ala Gin Glu His Glu Asn Gly Lys Ala Leu Cys 

35 40 45 

Gly Arg Trp Phe Ser Arg Tyr Asp Pro Asn Cys Gly Lys Arg Ala Arg 

50 55 60 

Asn Val Asp Leu His Ser Leu Val Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 3 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met Ala His Gin Gly Gin Asp He Tyr Asp Phe Pro Arg Ala Met He 

15 10 15 

Lys Glu Asp Asn Leu Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe 

20 25 30 

He Asn Leu His His Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr 

35 40 45 

Glu Asp Leu Cys Ala Ser Phe Gin Ala Ala Val Leu Asp He Leu Met 

50 55 60 

Ala Lys Thr Lys Lys Ala Leu Glu Lys Tyr Pro Val Lys Thr Leu Val 
65 70 75 80 

Val Ala Gly Gly Val Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala 

85 90 95 

Ala Glu Val Thr Asp Val Lys Val He He Pro Pro Leu Arg Leu Cys 

100 105 110 

Gly Asp Asn Ala Gly Met He Ala Tyr Ala Ser Val Ser Glu Trp Asn 
115 120 125 
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Lys Glu Asn Phe Ala Asn L u Asp Leu Asn Ala Lys Pro Ser Leu Ala 

130 135 140 

Phe Asp Thr Met Glu 
145 



{2} INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Lys Arg Ser Leu Asp Ser Arg 

1 5 

Val Phe Phe Leu Leu Val He Gly 
20 

Ser His Asp Tyr Pro Asn Asn He 

35 40 

Ala Trp He Ala Leu Gly Leu Val 

50 55 

Asn Thr Glu Phe Leu Trp Lys Val 
65 70 

Leu Gly Thr Tyr Asp Leu Ala Asp 
85 



Val Asp Tyr Ser Leu Leu Leu Pro 

10 15 
Val Val Ala He Tyr He Ala Val 
25 30 
Leu Pro lie Leu Gly Gin Gin Val 
45 

He Gly Phe Val Val Met Leu Phe 
60 

Thr Pro Phe Leu Tyr He Phe Arg 

75 80 
Cys lie Leu 
90 



(2) INFORMATION FOR SEQ ID NO: 375: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



Met Pro Asp Asp lie lie Asp Ser Phe Trp Tyr lie lie Asp His Phe 
15 10 15 
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Leu Lys Asn Val Phe Glu Leu Glu Glu Glu Leu Glu Phe Gin Leu Leu 

20 25 30 

Asn Asn Gin Gly Lys lie Thr Phe His Phe Ser Ser Gin His Leu Pro 

35 40 45 

Thr Ala lie Asp Phe Asp Phe Asn His Pro Phe Asp Pro Arg Tyr Pro 

50 55 60 

Pro Arg Val Leu Val Leu Asp Met Asp Gly Arg Glu Thr lie Leu Leu 
65 70 75 80 

Pro Glu Glu Asn Asp Leu Phe 
85 

(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ IT NO: 376: 

Met Gly Lys Lys Asp Ala Ser Ala Met His Glu Met Arg Ala Ser Phe 

15 10 15 

He Gin Gly Ser He Glu Ala Gly His Thr Ala Glu Lys Ser Glu Gin 

20 25 30 

Val Phe Asp Val Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser 

35 40 45 

His Ala Tyr Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys 

50 55 60 

Thr His Tyr Pro Ala He Phe Tyr Gin Val Met Leu Asn Tyr Ser Asn 
65 70 75 80 

Ser Asp Tyr Leu He Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Ser 

85 90 95 

Leu Ser He Asn Thr He Pro Tyr His Asp Lys He Ala Asn Lys Ser 

100 105 no 

lie Tyr lie Gly Leu Lys Ser He Lys Gly Leu Gin Gin Gly Leu Gly 

115 120 125 

Ala Leu Asp Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 377: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Asp Lys Lys Gin Asn Leu Thr 

1 5 
Glu Leu Asn Gin lie Thr Gly Gly 
20 

Glu Thr lie Leu Ser Lys Phe Lys 

35 40 
lie Gin Leu 
50 



Ser Phe Gin Glu Leu Thr Thr Thr 

10 15 
Glu Trp Trp Glu Glu Leu Leu His 
25 30 
lie Thr Lys Ala Leu Glu Leu Pro 
45 



(2) INFORMATION FOR SEQ ID NO: 378: 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



Met His Glu Met Phe Thr Ala He 

1 5 
Ala Ala He Gin Arg Lys Phe He 
20 

Tyr Pro Ser Pro Ala Pro Val Lys 

35 40 
Glu Ala Gly Pro Thr Arg Leu Pro 

50 55 
Val Lys Arg He He Lys Val Val 
65 70 
Lys Ala Thr Val Thr Gly Val Leu 
85 



Ala Glu Ser Asp Met Lys Lys Ala 

10 15 
Pro Lys Val Asn Ala Leu Phe Ser 
25 30 
Ala He Leu Asn Tyr Met Gly Phe 
45 

Leu Val Pro Ala Pro Glu Glu Asp 
60 

Val Asp Gly Asp Tyr Glu Ala Thr 

75 80 
Arg Pro Asp Tyr 
90 
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(2) INFORMATION FOR SEQ ID NO: 379: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Met Leu Asn Glu Phe Pro lie Phe Asp Tyr Glu Asp He Gin Leu He 

15 10 15 

Pro Asn Lys Cys Val He Lys Ser Arg Ala Glu Ala Asp Thr Ser Val 

20 25 30 

Thr Leu Gly Asn His Thr Phe Lys Leu Pro Val Val Pro Ala Asn Met 

35 40 45 

Gin Thr He Leu Asp Glu Asn Val Ala Glu Gin Leu Ala Lys Gly Gly 

50 55 60 

Tyr Leu Tyr Thr Tyr Ala Pro Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO:380: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

Met Asp He Lys Leu Lys Asp Phe Glu Gly Pro Leu Asp Leu Leu Leu 

15 10 15 

His Leu Val Ser Lys Tyr Gin Met Asp He Tyr Asp Val Pro He Thr 

20 25 30 

Glu Val He Glu Gin Tyr Leu Ala Tyr Val Ser Thr Leu Gin Ala Met 

35 40 45 

Arg Leu Glu Val Thr Gly Glu Tyr Met Val Met Ala Ser Gin Leu Met 
50 55 60 
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Leu lie Lys Ser Arg 
65 

Leu Gly Asp Asp Leu 
85 

Arg Lys Phe Lys Leu 
100 

Thr Gly Pro Val Leu 
115 

Cys Gly Ala Cys Ala 
130 



Lys Leu Leu Pro Lys Val 
70 75 
Glu Gin Asp Leu Leu Ser 
90 

Leu Gly Glu His Leu Glu 
105 

Phe Gin Ser Ala Asp Arg 
120 
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Ala Glu Val Thr Asp 
80 

Gin lie Glu Glu Tyr 
95 

Ala Lys His Gin Glu 
110 

Val Asp Leu Arg Arg 
125 



(2) INFORMATION FOR SEQ ID NO: 381: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:381: 



Met Gin Ser Thr Glu Lys Lys Pro Leu Thr Ala Phe Thr Xaa lie Ser 

15 10 15 

Thr lie lie Leu Leu Leu Leu Thr Val Leu Xaa lie Phe Pro Phe Tyr 

20 25 30 

Trp lie Leu Thr Gly Ala Phe Lys Ser Gin Pro Asp Thr He Val He 

35 40 45 

Pro Pro Gin Trp Phe Pro Lys Met Pro Thr Met Glu Asn Phe Gin Gin 

50 55 ■ 60 

Leu Met Val Gin Asn Pro Ala Leu Gin Trp Met Trp Asn Ser Val Phe 
65 70 75 80 

He Ser Leu Val Thr Met Phe Leu Val Cys Ala Thr Ser Ser Leu Ala 

85 90 95 

Gly Tyr Val Leu Ala Xaa Lys Arg Phe Tyr Gly Xaa Arg He Tyr Cys 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 3 82: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

Met Leu Phe Met Arg Asp Ser Leu Asp Ser lie Val Glu Pro Val Leu 

15 10 15 

Asp Glu Met Gly Arg Phe Tyr Asp Trp Thr Glu Glu Glu Lys Ala Thr 

20 25 30 

Tyr Arg Ala Asp Val Lys Ala Ala Leu Ala Gin Asn Asp Leu Ala Glu 

35 40 45 

Leu Lys Asn 
50 

(2) INFORMATION FOR SEQ ID NO: 383 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOP' /LOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Met Lys lie Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 

15 10 15 

Lys Asp Gly lie Ala Glu Tyr Ser Lys Arg lie Ser Arg Phe Ala Lys 

20 25 30 

Phe Glu Met He Glu Leu Ser Asp Glu Lys Thr Pro Asp Lys Ala Ser 

35 40 45 

Glu Ser Glu Asn Gin Lys He Leu Glu He Glu Gly Gin Arg He Leu 

50 55 60 

Ser Lys lie Ala Asp Arg Asp Phe Val He Val Leu Ala lie Glu Gly 
65 70 75 80 

Lys Thr Phe Phe Ser Glu Glu Phe Ser Lys Gin Leu Glu Glu Thr Ser 

85 90 95 

lie Lys Gly Xaa Ser Thr Leu Thr Phe lie lie Gly Gly Ser Leu Gly 

100 105 HO 

Leu Ser Ser Ser Val Lys Asn Arg Ala Asn Leu Ser Val Ser Phe Gly 
115 120 125 
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Arg Leu Thr Leu Pro His Gin Leu Met Arg Leu Val Leu Val Glu Gin 

130 135 140 

lie Tyr Arg Ala Phe Thr lie Gin Gin Gly Phe Pro Tyr His Lys 

145 150 155 

(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Met Gly lie Leu Ala Trp Cys Asp Leu Val Leu Tyr Arg Leu Trp Asp 

15 10 15 

Ser Phe Met Asp Leu Gly Leu Phe lie Asn Asp Ala Trp Val Arg Lys 

20 25 30 

Lys Lys Thr Leu Asn Lys Glu Arg Lys Lys Ala Gly Lys Ala Ala L' a 

35 40 45 

Pro Glu Asn Arg Trp He Gin Leu Leu Gly Met Val Val Thr Phe His 

50 55 60 

Val Val Met Leu Ser Phe Leu He Phe Ser Gly Phe Leu Asn Asn Leu 
65 70 75 80 

Trp Phe Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

Met Val Met Ala Glu Asp Gin Ala Val Arg Gin Asn Arg Leu Ala He 
15 10 15 
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Leu Ser Gin Leu Thr Lys Lys Ala Ala Lys Phe Ala Cys Phe Asn Gin 

20 25 30 

lie Asn Thr Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:386: 

Gly Ala Gin Pro Val Gin Asp Thr Glu Thr Ser Ser Ala Leu He 

5 10 15 

Ser His Tyr Leu Asp Glu Gin Asp Leu Ser Glu Lys Leu Lys Ser 

20 25 30 

Leu Gin Trp Phe Glu Leu Glu Asn Lys Leu Leu Asn Leu Trp Glu 
35 40 45 



Met 

i 

Ser 
G" u 
His 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Met Asp Glu Val Thr He Phe Gly He Asn Tyr Phe Lys Glu His Tyr 

1 5 io 15 

Pro Glu Lys Leu Ala Glu Arg Phe Lys Gin Met Lys He Glu Glu Glu 

20 25 30 

Ala Pro Val He He Met Asp Met Thr Arg Ala Leu Gly Phe Arg Asp 
35 40 45 
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Asp Tyr Asp Arg Phe Tyr Ser Leu Phe Arg Glu Gly Ser Pro 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Lys Val lie Asn Gin Thr Leu Leu Glu Lys Val lie lie Glu Arg 

15 10 15 

Ser Arg Ser Ser His Lys Gly Asp Tyr Gly Xaa Leu Leu Leu Leu Gly 

20 25 30 

Gly Thr Tyr Pro Tyr Gly Val Xaa Ser Ser Trp Leu Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:389: 

Met Asn Trp lie Leu Leu Gin Gly Leu lie Cys Gin Arg Asp Ala Ser 

1 5 10 15 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu lie Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg lie Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu lie Ser Arg His Lys Arg Leu 
65 70 75 80 
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Val Leu lie Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys lie He 

100 105 no 

Asp Lys Gly He Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

115 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Met Gin Glu Thr Met His He Phe Asp Val Leu Gly Leu Pro Glu Gin 
145 "0 155 160 

Val Ser Leu Leu Ser Gly Thr Leu Arg Lys He Cys Gin Lys Leu He 

165 170 175 

Phe Pro Asn Lys Gly Lys Asn Lys Lys Leu Leu Ser Val Leu He Gin 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 390: 

Met Ser Thr Val Leu Ser Trp Thr Ala Tyr Lys Thr Gin Arg Leu Glu 

1 5 10 15 

Met Ser He Leu Leu His Met He Val Asn Gly He Ala Phe Cys Leu 

20 25 30 

Leu Ala Leu Val Val He Met Ser Arg Thr Leu Gly He Ser Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: 
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SEQ ID NO: 391: 



Met Arg Glu Asn Asp Leu Leu Leu lie Thr Ala Asp His Gly Asn Asp 

1 5 10 15 

Pro Thr Tyr Ala Gly Thr Asp His Thr Arg Glu Tyr lie Pro Leu Leu 

20 25 30 

Ala Tyr Ser Pro Ala Phe Lys Gly Asn Gly Leu He Pro Val Gly His 

35 40 45 

Phe Ala Asp He Ser Ala Thr Val Ala Asp Asn Phe Gly Val Glu Thr 

50 55 60 

Ala Met He Gly Glu Ser Phe Leu Asp Lys Leu Val 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met Lys Lys He Phe Leu Thr Leu Leu Thr Val Ser Leu Leu Gly Gly 

15 10 15 

Ala Ser Thr Ala Val Ala Gin Asp Phe Thr He Ala Ala Lys His Ala 

20 25 30 

He Ala Val Glu Ala Asn Thr Gly Lys He Leu Tyr Glu Lys Asp Ala 

35 40 45 

Thr Pro Pro Val Glu He Ala Ser He Thr Lys Leu He Thr Val Tyr 

50 55 60 

Leu Val Tyr Glu Ala Leu Glu Asn Gly Ser He Thr Leu Ser Thr Pro 
65 70 75 . 80 

Val Asp He Ser Asp Tyr Pro Tyr Gin Leu Thr Thr Asn Ser Glu Ala 

85 90 95 

Ser Asn He Pro Met Glu Ala Arg Asn Tyr Thr Val Glu Glu Leu Leu 

100 105 HO 

Glu Ala Thr Leu Val Ser Ser Ala Asn Ser Ala Ala He Ala Leu Ala 

115 120 125 

Glu Lys lie Ala Gly Ser Glu Lys Asp Phe Val Xaa Met Met Arg Ala 
130 135 140 



296 



WO 97/37026 PCT/US97/05306 

Lys Leu Leu Glu Trp Gly He Arg He His Cys Cys Gin Tyr Asp Arg 
145 150 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO: 3 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3 93: 

Met Ser Asn Ala Met Glu Gly Val Leu Tyr Phe Leu Lys Pro Asp Phe 

15 10 15 

Ser Lys Leu Thr Ser Ala Asp Leu Leu Tyr Ala Leu Gly Gin Ser Phe 

20 25 30 

Phe Ala Leu Ser Leu Gly Val Thr Asp Met Leu Thr Tyr Ala Ser Tyr 

35 40 45 

Leu Asp Lys Lys Thr Asn Leu Val Gin Ser Gly He Ser He Val Thr 

50 55 60 

Met Asn He Ser He Val His His Gly Arg Ser Ser His Phe Pro Ser 
65 70 75 80 

His Val Ser Leu Gin Tyr Pro Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94 : 

Met Lys Val Ala Lys Pro Phe Trp Ala He Val Lys Leu Lys Ser Val 
15 10 15 
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Gin His Ala Ser Met Leu Glu Asn 
20 

Lys Gin Val Glu Asn Leu Ala Leu 

35 40 
Lys Glu Asn Lys Ala Phe Glu Gin 

50 55 
Gin Thr Phe Leu Thr lie Phe Leu 
65 70 
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Pro Lys Glu Met Asp Gly Leu Met 
25 30 
Glu Asn Gin Gly Tyr Gin Val Glu 
45 

lie Lys Asp Ser Val Ala Thr Phe 
60 

Tyr Gly Cys 
75 



(2) INFORMATION FOR SEQ ID NO: 395: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 3 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



Met Asp Ala Glu Val Ser lys Asn 

1 5 
Gly Met Thr He Leu Thr Gly Thr 
20 

Asn Gly Gin Leu Arg He Lys Val 

35 40 
Ser Lys Ala Leu Leu Ser He Gly 

50 55 
Gly Glu Val Glu Phe Glu Leu Asp 
65 70 
Tyr Met Glu Thr Ser Val Pro Arg 
85 



Leu Arg Leu He Leu Glu Arg Lys 

10 15 
Lys Leu Gin Glu He lie Glu Glu 
25 30 
Glu Gly Lys Asp Asn He He Ala 
45 

Arg Met Pro Asp Leu Glu Gly He 
60 

Arg Gly Cys He Lys Val Asn Glu 

75 80 
He Tyr Ala Thr Arg 
90 



(2) INFORMATION FOR SEQ ID NO: 396: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 
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Met Lys Asn Ser He Met Asp Thr Lys Phe Asp Arg Arg He Leu Leu 

15 10 15 

Leu Asn Lys He He He Val Phe He Val Leu Met Thr Leu Leu Pro 

20 25 30 

Leu Leu Tyr He Val Val Ala Ser Phe Met Asp Pro Lys Val Leu Val 

35 40 45 

Ser Arg Gly He Ser Phe Asn Pro Ala Asp Trp Thr Val Glu Gly Tyr 

50 55 60 

Gin Arg Val He Gin 
65 

(2) INFORMATION FOR SEQ ID NO: 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Val He Pro Glu Ala Gly Tyr Leu Tyr Ala Phe His Tyr Pro Asn 

15 10 15 

Leu Lys Gly Lys Gly Lys Glu Ala Val Gin Val He Tyr Asn Leu Asn 

20 25 30 

Leu Ala Ser Ala Lys Val He Gin Leu Tyr Arg Ser Leu Gly Leu Asp 

35 40 45 

Gly Lys He Gly He He Leu Asn Leu Thr Pro Ala Tyr Pro Arg Ser 

50 55 60 

Asn Ser Pro Glu Asp Leu Glu Ala Ser Arg Phe Thr Asp Asp Phe Phe 
65 70 75 80 

Asn Lys Val Phe Leu Glu Ser Ser Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 



Met Thr Arg lie Ala Ser Val Ser 

1 5 
Ala lie Ala Leu Ser Phe Leu Gly 
20 

lie Pro Asn Ala Val Leu Gly Gly 

35 40 
He Ala Ser Asn Gly Leu Lys Val 

50 55 
Ala Gin Met Arg Asn Leu He He 
65 70 
Leu Gly Arg Ser Tyr Pro 
85 



Val He Arg Asn Ala Ala Phe He 

10 15 
Lys Phe Thr Ala Leu He Ser Thr 
25 30 
Met Ser He Leu Leu Tyr Gly Val 
45 

Leu He Lys Glu Arg Val Asp Phe 
60 

Ala Ser Ala Met Leu Val Leu Gly 
75 80 



(2) INFORMATION FOR SEQ ID NO: 3 99: 

(i) SENTENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 



Met Ser Leu Arg Glu Lys Ser Met Ser Glu Tyr Lys Leu Ser Glu Asn 

15 10 15 

Asn Trp Thr Arg Val Ala Val Phe Ala Gly Gly Asn Arg Gly His Tyr 

20 25 30 

Arg Thr Asp Phe Asp Ala Phe Val Gly Val Asp Arg Gly Ser Leu Trp 

35 40 45 

Val Leu Glu Glu Asp Leu Pro Leu Ala Leu Ala Val Gly Asp Phe Asp 

50 55 60 

Ser Val Thr Glu Glu Glu Arg Gin Val He Gin Lys Arg Ala Gin Tyr 
65 70 75 80 

Phe Val Gin Ala Arg Pro Glu Lys Asp Asp Thr Asp Leu Glu Leu Ala 
85 90 95 
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Leu Leu Thr lie Phe Glu Gin Asn 
100 

Gly Ala Leu Gly Gly Arg He Asp 
115 120 

Thr 
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Pro Gin Ala Glu Val Thr He Phe 
105 110 
His Met Leu Ala Asn Val Leu Ser 
125 



(2) INFORMATION FOR SEQ ID NO: 4 00: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



Met Val Leu Gin Arg Asn Glu He Asn Glu Lys Asp Thr Trp Asp Leu 

15 10 15 

Ser Thr He Tyr Pro Thr Asp Gin Ala Trp Glu Glu Ala Leu Ly ; Asp 

20 25 30 

Leu Thr Glu Gin Leu Glu Thr Val Ala Gin Tyr Glu Gly His Leu Leu 

35 40 45 

Asp Ser Ala Asp Asn Leu Leu Val Asn His 
50 55 



(2) INFORMATION FOR SEQ ID NO: 401: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: -single 

(D) TOPOLOGY: linear* 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Met He Tyr Gly Ala He He Asp Ala Lys Thr Ala Glu Asn Ala Ala 

15 10 15 

Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys Val He 

20 25 30 
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Asn Asp Leu Thr lie Gin Tyr Asn Arg Ala Gin Thr Gly Gly Tyr Tyr 

35 40 45 

Thr Arg Asn Tyr Arg Asn Arg Ser Ser Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Met Thr Asp Gly Tyr Glu His Tyr Lys Lys lie Met Glu Asn Pro Pro 

15 10 15 

Lys Asn Leu lie Phe Pro Gly lie Val Ser Pro Glu Arg Met Arg Glu 

20 25 30 

\ie Val Cys Ser Asn Gly Ser Phe Leu Val Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 

Met Ser Lys Gin Lys Lys Phe Glu Glu Asn Leu Ala Glu Leu Glu Thr 

15 10 15 

lie Val Gin Ser Leu Glu Asn Gly Glu lie Ala Leu Glu Asp Ala lie 

20 25 30 

Thr Ala Phe Gin Lys Gly Met Val Leu Ser Lys Glu Leu Gin Ala Thr 

35 40 45 

Leu Asp Lys Ala Glu Lys Thr Leu Val Lys Val Met Gin Glu Asp Gly 
50 55 60 
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Thr Glu Ser Asp Phe Glu 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Phe Ala Ala Val Thr Ala Gly Tyr Gin Ala Ala Leu Met Val Pro 

15 10 15 

Thr Glu He Leu Ala Glu Gin His Phe Glu Ser Leu Gin Asn Leu Phe 

20 25 30 

Pro Asn Leu Lys Leu Ala Leu Leu Thr Gly Ser Leu Lys Ala Ala Glu 

35 40 45 

Lys Arg Glu Val Leu Glu Thr He Ala Lys Gly G) i Ala Asp Leu He 

50 55 t. 

He Gly Thr His Ala Leu He Gin Asp Gly Val Glu Tyr Ala Arg Leu 
65 70 75 80 

Gly Leu He He He Asp Glu Gin His Arg Phe Gly Val Gly Gin Arg 

85 90 95 

Arg He Leu Arg Glu Lys Gly Asp Asn Pro Asp Val Leu Met Met Thr 

100 105 no 

Ala Thr Pro He Pro Arg Thr Leu Ala lie Thr Ala Phe Gly Asp Met 

115 120 125 

Asp Val Ser He He Asp Gin Met Pro Ala Gly Arg Lys Pro Tyr Cys 

130 135 140 

Asp Ala Leu Asp Gin Thr 
145 150 

(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405: 

Met Asn Arg Glu Ala Leu Arg Leu Tyr Leu Val Thr Asn Arg Tyr Gin 

X 5 10 15 

Asp Ser Val Glu Ser Phe Leu Ala Lys Val Glu Thr Ala Cys Arg Ser 

20 25 30 

Gly Val Thr He Val Gin Leu Arg Glu Lys Asn Leu Thr Thr Asn Gin 

35 40 45 

Tyr Tyr Gin Leu Ala Lys Gin Val Lys Glu He Thr Asp Ala Tyr Gin 

50 55 60 

Val Pro Leu He He Asp Asp Arg Leu Asp Val Cys Leu Ala Val Asp 
65 70 75 80 

Ala Ala Gly Leu His He Gly Asp Asp Glu Leu Pro Val Ser Val Ala 

85 90 95 

Arg Gin Val Leu Gly Pro Glu Lys He Pro Arg Cys His Arg 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Gin Gly Ser His Phe His Leu Pro He Tyr Arg Met Thr Ser Gin 

1 5 10 I 5 

Ala Leu Leu Asp Glu Thr Lys Lys Val Ala He Pro Val Leu Ala Thr 

20 '25 30 

Thr Gin Ser Lys Asp Ser Val Asp Tyr Arg Glu Leu Pro Ser He Glu 

35 40 45 

Asn Phe Val Leu Val Met Gly Asn Glu Gly Gin Gly He Ser Pro Leu 

50 55 60 

Met Ala Glu Ser Ala Asp Gin Leu Val His He Ser Met Lys Gly Gin 
65 70 75 80 

Ala Glu Ser Leu Asn Val Ala Val Ala Ala Gly He Leu He Phe His 
85 90 95 

Leu Ser 
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(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 

15 10 15 

Arg Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 

20 25 30 

lie Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu lie 

35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 

50 55 60 

lie Leu Ala Thr Ser Val Ala lie fyr Arg Leu Leu Gin Glu Lys Gly 
65 70 75 80 

Tyr His Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 

85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 110 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 

115 120 125 

Lys Met Val Ala Val Leu His Thr Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 408: 

£i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 
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Met Tyr Gin Phe Ser Gly Gin Thr Lys Val Xaa Glu Val Leu Ala Phe 

15 10 15 

Arg Asp Lys Pro Pro Tyr Gly Gly Ser Ser Ala Met Pro Leu Arg Cys 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

Met Thr Lys Glu Thr Gly Met Glu Gin Thr Phe Phe lie lie Lys Pro 

15 10 15 

Asp Gly Val Lys Arg Gly Leu Val Gly Glu Val Leu Lys Arg lie Glu 

20 25 30 

Gin Arg Gly Phe Thr lie Glu Lys Leu Glu Phe Arg Ser Gin Val Phe 

35 40 45 

Arg Arg Val Asp 
50 

(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Thr Ser Gly Pro Val Leu Val Gly Val He Ser Gly Pro Lys Val 
15 10 15 
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lie Glu Thr Trp Arg Thr Met Met 
20 

Leu Pro Gly Thr lie Arg Gly Asp 

35 40 
Glu lie He Gin Asn Val Val His 

50 55 
Leu Ser Arg Glu He Ala Pro Leu 
65 70 
Gin Leu Asp Lys Ser Ser Phe Glu 
85 
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Gly Ala Thr Arg Pro Glu Glu Ala 
25 30 
Phe Ala Lys Ala Ala Gly Glu Asn 
45 

Gly Ser Asp Ser Glu Lys Asn Gin 
60 

Val Leu Arg Val Asp Trp Leu Asn 
75 80 



(2) INFORMATION FOR SEQ ID NO: 411: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE JESCRIPTION: SEQ ID NO: 411: 



Met Gly Leu Lys Asp Asn Ala Gly Leu Val Lys 

15 10 
Asp Ala Val Glu Tyr Gin Asn Val He Val Ala 

20 25 
Met Leu Ser Lys Tyr Ala Asn Lys Asn Asp Leu 

35 40 
Thr Thr Glu Ser He Lys Gly Val Val Ala Asp 

50 55 
lie Lys Gly Asp Thr Val Tyr Phe Phe Lys Val 
65 70 75 

Lys Val Lys Ala Ser Val Ser Asp Asp Leu Pro 

85 90 
Lys Thr Phe Glu Gly Gin Val Xaa Lys Asp Asn 
100 105 

Lys Val Gin 
115 



Glu Tyr Ala Leu Val 
15 

Thr Thr Val Glu Glu 
30 

Glu He Asp Asn Ala 
45 

Leu Lys Ser Ala Val 
60 

Asp Gly Lys He Tyr 
80 

Tyr Leu Glu Asn Gly 
95 

Tyr Leu Lys Thr Phe 
110 



(2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 

<C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



Met He Ser Gly Ser Glu He Arg 

1 5 
Leu Ala Asp Lys Thr Glu Thr Leu 
20 

Gly Leu Asp Asn He Thr Val Ala 
35 40 

Glu Glu 
50 



Asp He Val Thr Ser Asp He Pro 

10 15 
Val Arg Phe Ala Asn Asn Ala Gly 
25 30 
Leu Val Ser Met Asn Glu Glu Asp 
45 



(2) INFORMATION FOR SEQ ID NO: 413: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



Met Pro Asp Asn Leu Ala Leu Arg 

1 5 
Val He Gly Gin Glu Xaa Leu Val 
20 

Met Val Glu Ala Asn Arg Leu Ser 

35 40 
Gly lie Gly Lys Thr Ser He Ala 

50 55 
Tyr Ala Phe Arg Thr Phe Asn Ala 
65 70 



Met Arg Pro Lys Thr He Asp Gin 

10 15 
Gly Pro Gly Lys He He Arg Arg 
25 30 
Ser Met He Leu Tyr Gly Pro Pro 
45 

Ser Ala He Ala Gly Thr Thr Lys 
60 

Thr Val Asp Ser 
75 



(2) INFORMATION FOR SEQ ID NO: 414: 



{i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 33 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Pro Glu Glu Leu Ala Tyx Leu Val Gin His Leu Asp Asn Ala Gin 

15 10 15 

Glu Gin Val Val Leu Gly Asn Thr Tyr His Thr Gly Asn His Cys Phe 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Met Val Phe Trp Glu Gly Leu Val Pro Thr Pro Asp Val He Asp Ala 

1 5 io 15 

Gly His Val Thr Gly Phe Leu Tyr Thr Val His Ser Ser Ser Thr Gly 

20 25 30 

Leu He Val Leu Gin He Lys Lys Asp Leu Leu Lys Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Met Thr Asp Arg Tyr Asp Ser Ser Leu Gly lie Tyr Lys Val Pro Trp 

15 10 15 

Cys Val Ser Gin Gin Gin Thr Val Thr Glu He Met Asp Thr Tyr Cys 

20 25 30 

Asp Trp Gly Val Lys Tyr Pro Leu Val Tyr Leu Glu Asp Pro Phe Ser 

35 40 45 

Asp Glu Asp Leu Asp Ser Trp Arg Lys Phe Gin Leu He Lys Pro Leu 

50 55 60 

Lys Leu Gin Val Phe Gly Asp Asp Phe Tyr Ala Thr Asn Leu Glu Arg 
65 70 75 80 

He Ser Gin Phe Lys Asp Cys Ala Asp Gly He Val He Lys Pro Asn 

85 90 95 

Gin Val Gly Ser Val Ser Lys Thr Leu Glu Val Met Glu Tyr Ala Glu 

100 105 HO 

Lys Ser Gly He Ser Met Ala Phe Ser Gin Arg Thr Ala Glu Thr Glu 

115 120 125 

Asn Asn He He Ser His Leu Ala Met Ser Val He Leu Leu lie 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 417: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Lys Lys He Ser Lys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

1 5 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu He Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly He Thr Ala He Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 
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Gly Leu Ala Pro Tyr Phe Asn Gin 
85 

Thr Gin Lys Pro Asp Ala Leu Phe 
100 

Ala Gly Phe Ser Lys Glu Lys Thr 
115 120 
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Val Phe He Ser Glu Gin Leu Gin 

90 95 
Tyr Glu Lys He Gly Gin Gin He 
105 110 



(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met His Arg Lys Thr Val He Asp Xaa Arg Thr Leu Gly Glu Arg Tyr 

15 10 15 

Thr Phe Thr Xaa Pro He Lys Glu Leu Lys Thr Arg Asn Val Ala Glu 

20 25 30 

Val Ala Asp Leu Leu Ala Xaa Val Glu Ser Tyr Gin Glu Gin Asp Tyr 

35 40 45 

Tyr Val Val Gly Tyr Val Ser Tyr Glu Ala Ala Pro Ala Phe Glu Glu 

50 55 60 

Lys Leu Ala Val His Lys Val Pro Leu Leu Gly Glu Tyr Leu Leu Tyr 
65 70 75 80 

Phe Thr Val His Asp Arg Val Glu Thr Ser Pro He Pro Leu Thr Tyr 

85 90 95 

Glu Asp He Asp Leu Pro Ser Asn Trp Gin Gly Xaa Asn Val Cys Thr 
100 105 110 

Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



311 



WO 97/37026 



(ii) MOLECULE 



TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Glu Arg lie Glu Val Arg Ser Arg Ser Gly Ser Ala His Leu Gly 

15 10 15 

His Val Phe Thr Asp Gly Pro Arg Glu Leu Gly Gly Leu Arg Tyr Cys 

20 25 30 

lie Asn Ser Ala Ser Leu Arg Phe Val Ala Lys Asp Glu Met Glu Lys 

35 40 45 

Ala Gly Tyr Gly Tyr Leu Leu Pro Tyr Leu Asn Lys 
50 55 60 

{2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

Met Glu Trp He Arg Leu He Gly He Ala He He Val Val Gly Phe 

15 10 15 

He Leu Lys Phe Asp Thr He Ala Thr Val Val Leu Ala Gly Leu Val 

20 25 30 

Thr Ala Leu Val Ser Gly Val Ser Leu Val Glu Phe Leu Glu He Leu 

35 40 45 

Gly Lys Glu Phe Ser Asn Gin Arg Val Leu Thr He Phe Met Val Thr 

50 55 60 

Leu Pro Leu Val Gly Leu Ser Glu Thr Phe Gly Leu Asn Asn Asp Gin 
65 70 75 80 

Ser He 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

Met Leu Leu lie Phe Ala Val lie Gly Val Leu Val Ala Leu lie Ala 

1 5 10 15 

Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala Lys Glu Leu Thr 

20 25 30 

Asn Asp Leu Tyr Arg His He Leu Ser Leu Pro Lys Asp Ser Arg Asp 

35 40 45 

Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr Ser Asp Thr Tyr 

50 55 60 

Gin He Gin Thr Gly He Asn Gin Phe Leu Arg Leu Phe Leu Arg Ala 
65 70 75 80 

Pro He He Val Phe Gly Ala He Phe Met Ala Tyr Arg lie Ser Ala 

85 90 95 

Glu Leu Thr Phe Trp Phe Leu Val Met Val Ala He Leu Thr He Val 

100 105 HO 

He Val Arg Val He Ser He Gly Gin Ser Ser Leu Gin 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 422: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 



Met Pro He Glu Gin Val Gin Lys Leu Leu Gly His Ser Lys He Asp 

15 10 15 

Thr Thr Leu Ala Tyr Ala Met Val Asn Gin Asn Asn Val Lys His Ser 

20 25 30 

His Gin Lys Phe He Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 423: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 



Met Lys Pro Cys Tip Lys Lys Cys 

1 5 
Gly Glu Val Thr Leu He Glu He 
20 

Lys Gin Val His Glu Leu Asn Leu 

35 40 
Gin Val His Asn Gly Lys Ser Gin 

50 55 
Tyr Leu Gly Asp Met He His Leu 
65 70 
Lys Val Lys Asp Leu Leu Leu 
85 



Phe Pro Glu Glu Val Ser Ser Glu 

10 15 
Pro Val Ser Asp Lys He Ala Gly 
25 30 
Pro His Asn Val Leu He Thr Thr 
45 

Thr Val Asn Gly Ser Thr Arg Met 

60 

Val He Pro Lys Ser Glu He Gly 
75 80 



(2) INFORMATION FOR SEQ ID NO: 424: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:424: 

Met Asn Glu Gly Val Glu Asn Phe Arg Ala Lys Leu Ala Ser Leu Gly 

15 10 15 

Ala Lys Asn He Gly He Tyr Val Gly Val Tyr Phe Met Glu Glu His 

20 25 30 

Ser He Asp Thr Gly Lys Phe Thr Ser Val Trp He Pro Ser Tyr Gly 

35 40 45 

Ser Asp Ser Gly Phe Leu Glu Ser Ser Pro Lys Thr Asp Leu Asp Tyr 
50 55 60 
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Asp lie His Gin Tyr Thr Ser Lys Gly Lys Leu Pro Ala Leu Thr Thr 
65 70 75 80 

He 



(2) INFORMATION FOR SEQ ID NO: 425: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:425: 

Met Val Ser Lys Tyr Leu Leu Leu Ala Val He Phe Ser Gly Leu Val 

15 10 15 

Thr Trp He Leu Arg Met He Ser Phe He Leu Val Lys Tyr Lys Gly 

20 25 30 

Leu Pro Ala He Val Glu Arg Phe Leu Lys Phe Leu Pro Val Ser He 

35 40 45 

He Phe Ala Leu He Leu Ser Ser Val Val Thr Gly Lys Val Gly Ser 

50 55 60 

Leu Pro Gin He Lys Trp Leu Asp Phe Leu Ala Val Phe Pro Thr Ala 
65 70 75 80 

Trp Val Ala Phe Arg Tyr Arg Asn Leu Leu Gly Thr Val Leu Phe Gly 

85 90 95 

Val Val Leu He Ala He Leu Arg Leu Val Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 42 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:426: 
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Met Glu Asp Xaa Glu Thr Gin Arg Val Val Met Gin Tyr Arg Ala Pro 

15 10 15 

Glu Asn Asn Arg Trp Ser Gly Tyr Ala Phe Pro Gly Gly His Val Glu 

20 25 30 

Asn Asp Glu Ala Phe Ala Glu Ser Val lie Arg Glu lie Tyr Glu Glu 

35 40 45 

Thr Gly Leu Thr lie Gin Asn Pro Gin Leu Val Gly He Lys Asn Trp 

50 55 60 

Pro Leu Asp Thr Gly Gly Arg Tyr He Val He Cys Tyr Lys Ala Thr 
65 70 75 80 

Glu Phe Ser Gly Thr Leu Gin Ser Ser Glu Glu Gly Glu Val Ser Trp 

85 90 95 

Val Gin Lys Asp Gin He Pro Asn Leu Asn Leu Ala Tyr Asp Met Leu 

100 105 110 

Pro Leu Met Glu Met Met Glu Ala Pro Asp Lys Ser Glu Phe Phe Tyr 

115 120 125 

Pro Arg Arg Thr Glu Asp Asp Trp Glu Lys Lys He Phe 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 427: 



(i) SEQUENC CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



Met Thr He Glu Leu Leu Thr Pro 

1 5 
Glu He Lys Glu Lys Lys Arg Lys 
20 

Phe Asn Pro Val His Asn Ala His 

35 40 
Gin Gin Leu Gly Leu Asp Gin Val 

50 55 
Pro His Val 
65 



Phe Thr Lys Val Glu Leu Glu Pro 

10 15 
Gin Val Gly He Leu Gly Gly Asn 
25 30 
Leu He Val Ala Asp Gin Val Arg 
45 

Leu Leu Met Pro Glu Tyr Gin Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 



Met Lys Lys Leu Val Phe Val Cys Leu Gly Asn lie Cys Arg Ser Pro 

15 10 15 

Met Ala Glu Phe Val Met Lys Ser Met Thr Asp Asn Tyr Glu lie Gin 

20 25 30 

Ser Arg Ala Thr Ser Ser Trp Glu His Gly Asn Pro lie His Lys Gly 

35 40 45 

Thr Gin Gly lie Phe Gin Glu Tyr Glu lie Pro Tyr Asp Lys Asn Lys 

50 55 60 

Thr Ser Leu Gin lie Ser Lys Glu Asp Phe Glu Ala Phe Asp Tyr lie 
65 70 75 80 

He Gly Met Asp Ala Ser Asn Val Pro Thr Tyr Val Arg Cys Val Gin 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:429: 

Met Val Lys Tyr Gly Val Val Gly Ala Gly Tyr Phe Gly Ala Glu Leu 

15 10 15 

Ala Arg Tyr Met Gin Lys Asn Asp Gly Ala Xaa He Thr Leu Leu Tyr 

20 25 30 

Asp Pro Asp Asn Ala Glu Ala He Ala Glu Glu Leu Gly Ala Lys Val 

35 40 45 

Ser Lys Phe Leu Arg 
50 
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(2) INFORMATION FOR SEQ ID NO: 430: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

Met Leu Trp Lys Tyr Tyr Asn Ser Ser Phe Leu Lys Ala Val Pro His 

15 10 15 

Leu Thr Thr Glu Tyr Lys Arg Leu Ala Gin Ala His Gly Leu Asn Leu 

20 25 30 

Lys Gin Ala Lys Pro lie Thr Met Gly Met Trp lie Gly Gly Asp Arg 

35 40 45 

Glu Gly Asn Pro Phe Val Thr Ala Lys Asn Leu Lys Gin Ser Ala Leu 

50 55 60 

Thr Gin Cys Glu Val Xaa Leu Asn Leu Leu 
65 - 70 

(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 

Met Asn Thr Leu Tyr Leu Asn Gin His Ser Ser Tyr Lys Asn Asp Glu 

15 10 15 

Asp Ser His Ser Phe Pro lie Gin Met Glu Leu Val Ser Asp Glu Met 

20 25 30 

He Pro Arg Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



Met Ala Arg Lys Glu Thr Met Ala 

1 5 
Lys Asp He His Pro Glu Tyr Arg 
20 

Thr Gly Tyr Gin Phe Leu Ser Gly 

35 40 
Val Glu Phe Glu Gly Glu Thr Tyr 

50 55 
Ser Asp Ser His Pro Phe Tyr Thr 
65 70 
Asp Gly Arg Val Asp Arg Phe Asn 
85 



Glu Arg Lys Tyr Cys Lys Met Lys 

10 15 
Pro Val Val Phe Met Asp Thr Thr 
25 30 
Ser Thr Lys Arg Ser Asn Glu Thr 
45 

Pro Leu He Arg Val Glu He Ser 
60 

Gly Arg Gin Lys Phe Thr Gin Ala 

75 80 
Lys Lys Tyr Gly Leu Lys 
90 



(2) INFORMATION FOR SEQ ID NO: 433: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

Met Glu Thr Ala Leu He Ser Val He Val Pro Val Tyr Asn Val Ala 

15 10 15 

Gin Tyr Leu Glu Lys Ser He Ala Ser He Gin Lys Gin Thr Tyr Gin 

20 25 30 

Asn Leu Glu He lie Leu Val Asp Asp Gly Ala Thr Asp Glu Ser Gly 

35 40 45 

Arg Leu Cys Asp Ser He Ala Glu Gin Asp Asp Arg Val Ser Val Leu 
50 55 60 
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His Lys Lys Asn Glu Gly Leu Ser Gin Ala Arg Asn Asp Gly Met Lys 
65 70 75 80 

Pro Gly Ser Arg Gly Leu Ser Asp Phe Tyr 
85 90 



(2) INFORMATION FOR SEQ ID NO: 4 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



Met Tyr Phe Leu lie Asn Phe Val 

1 5 
Leu Pro Phe Leu lie Asn Thr Gly 
20 

Ser Tyr lie Ser Leu Leu Val Phe 

35 40 
Phe Leu Asn His Val Leu Ala Leu 
50 55 



Tyr Pro Val Asp Met Val He Asn 

10 15 
Leu He Val Leu Leu Ser Ala He 
25 30 
Thr Lys Asp Ser He Phe Tyr Glu 
45 

Lys Asn Lys Phe Lys Lys Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 435: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

Met Arg Val He Ala Trp Leu Glu Pro Ser Arg Lys He Leu Gin Pro 

15 10 15 

Gly Ala Asn Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu 

20 25 30 

Leu Leu Pro Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn 
35 40 45 
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Lys Arg Val Asp Val Glu He Leu Glu Gly Leu Asp Asp Glu Asp 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

Met Asp Asp Asp Phe Lys Arg Tyr Asn Ala Pro He Leu Thr Trp Tyr 

15 10 15 

Glu Thr Ala Arg Tyr Ala Phe Glu Arg Gly Met Val Trp Gin Asn Leu 

20 25 30 

Gly Gly Val Glu Asn Ser Leu Asn Gly Gly Leu Tyr His Phe Lys Glu 

35 40 45 

Lys Phe Asn Pro Thr He Glu Glu Tyr L' u Gly Glu Phe Thr Met Pro 

50 55 60 

Thr His Pro Leu Tyr Pro Leu Leu Arg Leu Ala Leu Asp Phe Arg Lys 
65 70 75 80 

Thr Leu Arg Lys Lys His Arg Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 

Met Glu Asp Phe Phe Ala Trp Cys Arg Arg Gin Ser Val Leu Ser Gly 

15 10 15 

Ser Lys Leu Gly Arg Ala He Glu Tyr Ser Leu Lys Tyr Glu Glu Thr 
20 25 30 
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Phe Lya Thr He Leu Lys Asp Gly His Leu Val Leu Ser Asn Asn Leu 

35 40 45 

Ala Glu Arg Ala He Lys Ser Leu Val Met Gly Arg Ser Lys Arg Val 

50 55 60 

Gin Trp Thr Leu Leu Ala 
65 70 



(2) INFORMATION FOR SEQ ID NO: 43 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 



Met Thr Lys Gin Gly Ser Trp Lys Leu He Leu Glu Asp Glu Ser Ala 

15 10 15 

Gly Arg Glu Leu Phe Pro He Leu Thr Gin Gly Xaa Tyr He Ala Thr 

20 25 30 

Phe Asp Gin Gin Ala Pro Xaa He Asp Glu He Phe Lys Leu 

35 40 45 



(2) INFORMATION FOR SEQ ID NO: 439: 



(i) SEQUENCE CHARACTERISTICS: 
{h) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.439: 



Met Thr Thr Asn Xaa Asp Asn Asp Thr Ser Glu Arg Ser Asp Cys Tyr 

15 10 15 

Arg Lys Arg Thr Ala Gly Glu Thr Pro Met Asn Asp Leu Thr Ser His 

20 25 30 

Thr His Gly Gly Asn Tyr Thr He Ala Arg Tyr Gin Glu Glu Lys Phe 
35 40 45 
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Trp Asn Lys Gin Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 

Met Ala Gin His Ala Ala Val Glu Ala Leu Thr Ala Gly Lys Asn Asp 

15 10 15 

Ala Glu Pro Met Lys Lys Glu Tyr He Gin Arg Arg Asp Tyr He He 

20 25 30 

Glu Lys Met Thr Ala Leu Gly Phe Glu He He Lys Pro Asp Gly Ala 

35 40 45 

Phe Tyr He Phe Ala Lys Jle Pro Ala Gly Tyr Asn Gin Asp Ser Phe 

50 jS 60 

Ala Phe Leu Lys Asp Phe Ala Gin Lys Lys Ala Val Ala Phe He Pro 
65 70 75 80 

Gly Ala Ala Phe Gly Arg Tyr Gly Glu Gly Tyr Val Arg Leu Ser Tyr 

85 90 95 

Ala Ala Ser Met Glu Thr lie Lys Glu Ala Met Lys Arg Leu Glu Glu 

100 105 no 

Tyr Met Arg Glu Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:441: 
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Met Asn lie Thr Asn Leu Phe Ser He Lys Thr Gly Cys Asp Glu Thr 

15 10 15 

Asp Arg Gin Leu Gin Lys Leu Phe Phe Gin Leu Asp Leu Gin Leu Gly 

20 25 30 

Glu Leu Thr Asp Gin Leu Arg Lys Leu Asp Ser Asn Phe Val Pro Arg 

35 40 45 

Ser Gin Phe Val Asp Thr Leu Asp Leu Asn Asp Val Glu Tyr Lys Glu 

50 55 60 

He Leu Asn Tyr Phe He Phe His Arg Asn Asp Ser Glu Glu Ser Leu 
65 70 75 80 

Val Glu Trp Leu Tyr Asp Trp He Ser Thr Asn Arg Tyr Glu Leu Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Leu Gin Trp He Asp Arg Thr Phe Gin Asp Tyr Leu Glu Leu Glu 

15 10 15 

His Leu Val Leu Thr Thr Trp Ser Gly Asn Leu Gly Met Met Lys Leu 

20 25 30 

Ala Glu Lys Leu Xaa Met Lys Lys Xaa Ala His lie Xaa Lys Val Arg 

35 40 45 

Tyr Tyr Gin Gly Lys Tyr Phe Asp Ser He Lys Tyr Gly He Leu Arg 

50 55 60 

Xaa Asp Trp Glu Lys He Asn Asp Gly Tyr Tyr Gin He Tyr Xaa Asn 
65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

Met lie Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

15 10 15 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

20 25 30 

Glu Gly lie Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

35 40 45 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys lie lie Gin Thr 

50 55 60 

His Gly His Leu Phe Asp He Asn Phe Asn Phe Gin Lys Leu Asp Tyr 
65 70 75 80 

Trp Ala Gin Glu Glu Glu Ala Ala He Cys Leu Tyr Gly His Leu His 

85 90 95 

Val Pro Ser Ala Trp Leu Glu Gly Lys He Leu Phe Leu Asn Pro Gly 

100 105 110 

Ser He Ser Gin Pro Arg Gly Thr He Arg Glu Cys Leu Tyr Ala Arg 

115 120 125 

Val Glu He *sp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 

130 135 140 

His Glu Val Tyr Pro Gly Cys Pro Arg Ser Leu Ala Asp Asp Ala Lys 
145 150 155 160 

Glu Leu Arg Leu Ser Val Gly Ala Glu Asn Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Met Asn Leu His Gin Pro Leu His Val Leu Pro Gly Val Gly Pro Lys 
15 10 is 
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Ser Ala Glu Lys Tyr Ala Lys Leu 
20 

Leu Leu Tyr Phe Pro Phe Arg Tyr 

35 40 
Leu Glu Leu Glu Asp Gly Glu Lys 

50 55 
Thr Pro Ala Ser Val Gin Tyr Tyr 
65 70 
Ala Phe Leu Val Leu Gin Ala Arg 
85 

Gly 
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Gly lie Glu Asn Leu Gin Asp Leu 
25 30 
Glu Asp Phe Lys Thr Lys Gin Val 
45 

Ala Val Leu Ser Gly Gin Val Val 
60 

Gly Phe Gin Ala Ala Asn Pro Pro 

75 80 
Glu Glu Gly Ser Phe Phe Phe Gly 
90 95 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Gin Leu Leu Ser Glu Asn Met Leu Lys Thr He Gin Ser Leu Ser 

15 10 15 

Val Trp Gin He Tyr Leu Leu Gly Phe Glu Arg He Leu Ala Leu Gly 

20 25 30 

Phe Gin Leu Leu Leu Thr Val Trp Val Tyr Gin Ala Val Arg Gin Lys 

35 40 45 

Lys Trp He Tyr Leu Leu Ala Ala Tyr Gly Leu His Ala Phe Phe Asp 

50 55 60 

Leu Ala Pro Ser Leu Phe Gin Val Gly Trp Leu Thr Asn Pro Val Leu 
65 70 75 80 

Val Glu Val He Leu Ala Leu Glu Leu Val Leu Val Ala Tyr Gly Thr 

85 90 95 

Lys Glu He Phe Cys Lys Lys Ser 
100 

(2) INFORMATION FOR SEQ ID NO:446: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

Met Thr Lys Ser Met Thr Pro Asp Arg Glu Val lie Thr Phe lie Pro 

15 10 15 

Glu Lys Phe lie Val Asp Gly Phe Gin Gly He Arg Asp Pro Arg Gly 

20 25 30 

Met Met Gly Val Arg Leu Glu Met Arg Gly Leu Leu Tyr Thr Gly Pro 

35 40 45 

Arg Thr He Leu His Asn Leu Arg Lys Thr Val Glu Arg Ala Gly Val 

50 55 60 

Gin Val Glu Asn Val He He Ser Pro Leu Ala Met Val Gin Ser Val 
65 70 75 80 

Leu Asn Glu Gly Glu Arg Glu Phe Gly Ala Thr Val He Asp Met Gly 

85 90 95 

Ala Gly Gin Thr Thr Val Ala Thr He Arg Asn Gin Glu Leu Gin Phe 

100 105 110 

Thr His He Leu Gin Glu Val Glu He Met 
115 120 

(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Leu Asn Ser He Val Thr He He Cys He Ala Leu lie Ala Phe 

15 10 15 

He Leu Phe Trp Phe Phe Lys Lys Pro Glu Lys Ser Gly Gin Lys Ala 

20 25 30 

Gin Gin Lys Asn Gly Tyr Gin Glu He Arg Val Glu Val Met Gly Gly 
35 40 45 
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Tyr Thr Pro Glu Leu lie Val Leu 

50 55 
Val Phe Asp Arg Lys Asp Pro Ser 
65 70 
Pro Asp Phe Gly Val His Ala Asn 
85 

Val Glu lie Thr Pro Glu Gin Ala 
100 

Met Asn Met Met His Gly Lys Met 
115 120 
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Lys Lys Ser Val Pro Ala Arg lie 
60 

Pro Cys Leu Asp Gin lie Val Phe 

75 80 
Leu Pro Met Gly Glu Glu Tyr Val 

90 95 
Gly Glu Phe Gly Phe Ala Cys Gly 
105 110 
He Val Glu 



(2) INFORMATION FOR SEQ ID NO: 448: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 



Met Lys He Val Ser Gly He Tyr Gly Gly Arg Pro Leu Lys Thr Leu 

15 10 15 

Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala He 

20 25 30 

Phe Asn Met He Gly Pro Tyr Phe Glu Val Gly Xaa Val Leu Asp Leu 

35 40 45 

Tyr Ala Gly Ser Gly Gly Leu Ser He Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr He Val 
65 70 75 80 

Ala Glu Asn He Gin Met Thr Lys Glu Val Gly Lys Phe Gin Leu Leu 

85 '90 95 

Gin Asn Gly Cys Arg Lys Gly He Gly Thr Gly Xaa Leu Gly Asn Leu 

100 105 110 

Thr Ser Phe Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 449: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 52 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Met Xaa Thr Asn Phe Thr Asp Gin Val Asp Thr Met He Tyr Val Asp 

15 10 15 

Lys Glu Glu Lys Glu Thr He Lys Ala Ala Leu Val Glu Phe Phe Asn 

20 25 30 

Gly Lys Val Thr Leu Thr Asp Gin Gly Leu Arg Glu Val Glu Val Pro 

35 40 45 

Val Asn Leu Val 
50 

(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

Met He Gly Val Ala Leu Glu Glu Ser Pro Thr Leu Lys He His His 

15 10 15 

Leu Thr His Asp He Thr Pro Tyr Asn He Phe Glu Gly Ser Tyr Arg 

20 25 30 

Leu Phe Gin Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser 

35 40 45 

Val Val Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys 

50 55 60 

Thr Ala Gin Lys Ser He His Cys His Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:451: 

Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 

1 5 10 15 

Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 

35 40 45 

Glu Gin Glu Met Gly He Asn Leu Pro Glu Gly Trp Val Pro Ala He 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 
65 70 75 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His He 

85 90 95 

Gly Tyr Ser He Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 10? " no 

Leu Ser Val Arg Ala Cys Lys Leu Lea Arg Lys Arg Thr Ser Arg Lys 
115 120 125 

Leu Trp 
130 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 

Met Thr Phe He Arg Gly Pro He He Ser Thr Val Gly Glu Gly Val 

15 10 15 

Xaa He Leu Ala Thr Val Asn Xaa Gin Phe Val Xaa Xaa Gin Xaa Xaa 
20 25 30 
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Asn Met Leu Val Ser Pro Phe His Pro Glu Leu Thr Asp Xaa Xaa Arg 

35 40 45 

Leu Xaa Gin Tyr Phe lie Ser Met Cys Lys Glu Lys Ser Xaa Asp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 453: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 

15 10 15 

Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

20 25 30 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 

Met Asp Lys Ala Leu Ala Asp Leu Lys Thr Ser Gly His Leu Pro He 

1 5 10 15 

Pro Arg His Leu Arg Asp Gly His Tyr Xaa Gly Ser Lys Glu Leu Gly 

20 25 30 

Asn Ala Gin Asp Tyr Leu Tyr Pro His Asn Tyr Pro Xaa Asn Trp Val 

35 40 45 

Lys Gin Asp Tyr Leu Pro Gin Lys lie Arg Asn His His Tyr Phe Gin 
50 55 60 
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Ala Glu Tyr Thr Gly Lys Tyr Glu Arg Ala Leu Ala Gin Arg Lys Glu 
65 70 75 80 

Ala lie Asp His Leu Arg Lys He 
85 



(2) INFORMATION FOR SEQ ID NO: 455: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 



Met Val Val Phe Thr Gly Ser Thr 

1 5 
Leu Lys Glu Leu Asp He Pro Arg 
20 

Ser Arg Glu Lys Lys G" y Phe Leu 

35 40 
Gin Val Asp He Glu Ala He Ser 

50 55 
Gin Gin Val Val Lys Gly Val Pro 
65 70 
Pro Val Lys Thr Val Ser Val Arg 
85 



Val Glu Glu Ala He Gin Lys Gly 

10 15 
Met Lys Ala His lie Lys Val He 
25 30 
Gly Leu Phe Gly Lys Lys Pro Ala 
45 

Glu Thr Thr Val Val Lys Ala Asn 
60 

Lys Lys He Asn Asp Leu Asn Glu 
75 80 

Asn Arg 

90 



(2) INFORMATION FOR SEQ ID NO: 4 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 : 



Met Thr Lys Arg Cys Ser Trp Val Lys Met Thr Asn Pro Leu Tyr He 
15 10 15 
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Ala Tyr His Asp Glu Glu Tip Gly 
20 

Leu Phe Glu Leu Leu Cys Met Glu 

35 40 
Glu Thr Val Leu Asn Lys Arg Gin 
50 55 
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Gin Pro Leu His Asp Asp Gin Ala 
25 30 
Thr Tyr Gin Ala Gly Leu Ser Trp 
45 

Ala Phe Arg Glu Ala Phe Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 457: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 



Met Phe Val Gly Gin Glu Trp Thr 

1 5 
Gly Asn His Gin Gly Gin Val Thr 
20 

Phe Pro Val Ser Ala Arg Ser Val 
35 40 



Asn Gin Thr Phe Val Asp Leu Leu 

10 15 
lie Asp Glu Glu Gly Tyr Gly Gin 
25 30 
Ser Val Trp Ala Val Asn Thr He 
45 



(2) INFORMATION FOR SEQ ID NO: 45 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



Met Ala Gin Val Asp He He His 

1 5 
Glu Gly Val Thr Lys Tyr Leu Gin 
20 

Gly Tyr Ala Pro Gin Asn Ala Gly 
35 40 



Gly He Gly Thr Gly Val He Arg 

10 is 
Arg Asn Lys His Val Lys Ser Phe 
25 30 
Gly Ser Gly Ala Thr He Val Thr 
45 
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Phe Lys Gly 
50 



PCT/US97/O5306 



(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 



Met Asn Leu Leu Glu Glu Ala Thr Lys Gly Lys Val lie Phe Glu Gly 

15 10 15 

Val Asp lie Thr Asp Lys Lys Asn Asp Leu Phe Pro Met Arg Glu Lys 

20 25 30 

Met Gly Met Val Phe Gin Gin Phe Asn Xaa Phe Leu He 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 460: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 



Met Gin He Asp Leu Ala Asn Leu 

1 5 
Ser Gly He Asn Pro Asn Arg Tyr 
20 

Pro He Leu Asp Arg His Met Thr 

35 40 
Ala Gin Ala Ala Gly Thr He Ala 

50 55 
Tyr Ala Tyr Leu Phe Glu Thr Gin 
65 70 



Leu Pro Asp Leu Pro Gly Asn Leu 

10 15 
Val Phe Tyr Gin Asp Val Leu Cys 
25 30 
Pro Glu Gin Asp Lys Pro His Phe 
45 

Asp lie Lys Glu Lys Ala Gly Asn 
60 

Ala Gin Leu Asn Ala He Leu Ser 
75 80 
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Ser Lys Val Asp Val Gly Arg Arg He Arg His Ala Tyr Gin Ala Asp 

85 90 95 

Asp Lys Glu Ser Leu Gin Gin He Ala Arg Gin Glu Leu Pro Glu Leu 

100 105 110 

Arg Ser Gin He Glu Asp Phe His Ala Leu Phe Ser His His Trp Leu 
115 120 125 

Lys 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(Dj TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 

Met Arg Gin Gly He Asp Ser Leu Ala Tyr Val Val Lys Thr His Phe 

1 5 10 15 

Glu Leu Asp Pro Phe Ser Gly Gin Ala Phe Leu Phe Cys Gly Gly Arg 

20 25 30 

Lys Asp Arg Phe Lys Ala Leu Tyr Trp Asp Gly Gin Gly Phe Trp Leu 

35 40 45 

Leu Tyr Lys Arg Phe Glu Asn Gly Lys Leu Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO:462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 : 

Met Asn Ala Phe Leu Glu Glu Leu Ser Gin Ala Tyr Pro Asp Asp Tyr 
1 5 10 15 
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Phe Leu Leu Val Met 
20 

Lys lie Pro Thr Asn 
35 

Glu Met Asn Pro lie 
50 

Phe Lys Asn Lys Ala 
65 

Gin Asp Val He Gin 
85 

Asn Arg Arg Trp Thr 
100 



Asp Asn Ala He Trp His 
25 

He Gly Phe Ala Phe He 
40 

Glu Gin Val Trp Lys Glu 
55 

Phe Arg He Leu Glu Asp 
70 75 
Gly Leu Glu Lys Glu Val 
90 

Arg Met Leu Phe Glu Ser 
105 
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Lys Ser Ser Val Leu 
30 

Pro Pro Tyr Thr Pro 
45 

lie Arg Lys Arg Gly 
60 

Val Met Asn Gin Leu 
80 

He Lys Ser He Val . 
95 

Arg 



(2) INFORMATION FOR SEQ ID NO:463: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 



Met Thr Val Glu Glu Glu Lys Val 

1 5 
Thr Glu Ala Gly Glu Phe Val Thr 
20 

Lys Lys Glu Leu Gly Arg Ser Tyr 

35 40 
Leu Lys Arg His Gly Trp Arg Asn 

50 55 
Lys Lys Ala Asp Ala Gin Thr He 
65 70 
He Gin Glu Asp Lys Lys Ala lie 
85 



Phe Leu Ala Arg His Leu Lys Ala 

10 15 
He Asp Ala Leu Phe Gin Ala Tyr 
25 30 
Thr Arg Asp Ala Phe Tyr Gin Leu 
45 

lie Met Pro Arg Pro Glu His Pro 
60 

Val Ala Ser Lys Asn Lys He Ser 
75 80 



(2) INFORMATION FOR SEQ ID NO:464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

336 



WO 97/37026 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

Met Leu Ser Leu Val Val Leu lie Thr Gin Tyr Ala Thr His Gly Glu 

15 10 15 

Leu Phe Glu Asn Tyr lie Ser Lys Glu Asn Asp Asn Val lie Lys Glu 

20 25 30 

Tyr Gin Asn Thr Thr Gly Phe Ser Thr Pro Tyr Thr Leu Asp Gly Ser 

35 40 45 

Val Leu lie Val His Pro Asp Leu Thr Lys Gly Met Asn lie Glu Gly 

50 55 60 

Tyr Ser Asp Leu Leu Lys Pro Glu Leu Lys Gly Lys He Ala Thr Ala 
65 70 75 80 

Asp Pro Ala Asn Ser Ser Ser Ala Phe Ala Gin Leu Thr Asn Met Leu 

85 90 95 

Gin Ala Gin Gly Gly Tyr Lys Asp Asp Leu Ala Trp Ser Val Leu Ser 

100 105 110 

Thr He Phe Ser Thr Leu He Asp Gly He Val Lys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 4 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:465: 

Met Leu Ala Arg Lys Asp Arg Phe Tyr Tyr He Asp Lys Ala Thr Gly 

15 10 15 

Lys Ala Leu Gly He Phe Leu Ser Cys Val Leu Thr Ser Asn Gly Val 

20 25 30 

He Glu Val Gly Ala Val Thr Phe Ser Pro Lys Leu Arg Gly Thr Arg 
35 40 45 
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lie Gly Thr Glu Ala Gin Tyr Leu Leu Ala Arg Tyr Val Phe Glu Glu 

50 55 60 

Leu Asn Tyr Arg Arg Tyr Glu Trp Lys Cys Asp Ala Leu Asn Leu Pro 
65 70 75 80 

Ser Arg Arg Ala Ala Glu Arg Leu Gly Phe lie Tyr Glu Gly Thr Phe 

85 90 95 

Arg Gin Ala Val Val Tyr Lys Gly Arg Thr Arg Asp Met Asp Trp Leu 

100 105 110 

Ser Met lie Asp Lys Asp Trp Pro Lys Val Lys Asp Arg Leu Glu Thr 

115 120 125 

Trp Leu Arg Pro Glu Asn Phe Asp Lys Asn Gly Gin Gin Tyr Lys Ser 

130 135 140 

Leu Arg Glu Leu 
145 



(2) INFORMATION FOR SEQ ID NO: 466 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:466: 



Met Met Thr Ser Asp Phe Phe Glu Val Trp Phe Gin Lys Phe Leu Leu 

15 10 15 

Pro Thr Leu Thr Thr Pro Ser Val lie lie Val Lys 



(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
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Met Val Ser Thr Lys Thr Gin lie Ala Gly Phe Glu Phe Asp Asn Cys 

15 10 15 

Leu Met Asn Ala Ala Gly Val Ala Cys Met Thr He Glu Glu Leu Glu 

20 25 30 

Glu Val Lys Asn Ser Ala Ala Gly Thr Phe Val Thr Lys Thr Ala Thr 

35 40 45 

Leu Asp Phe Arg Gin Gly Asn Pro Glu Pro Arg Tyr Gin Asp Val Pro 

50 55 60 

Leu Gly Ser He Asn Ser Met Gly Leu Pro Asn Asn Gly Leu Asp Tyr 
65 70 75 80 

Tyr Leu Asp Tyr Leu Leu Asp Leu Gin Glu Lys Glu Ser Asn Arg Thr 

85 90 95 

Phe Phe Leu Ser Leu Val Gly Met Ser Pro Glu Lys Pro He Leu Phe 
100 105 110 

(2) INFORMATION FOR SEQ ID NO:468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 

Met Thr Tyr Leu Met Phe Ser Gly Leu Asp Glu Asn Phe Tyr His Phe 

15 10 15 

Pro Trp Glu Val Phe Ala Gly Phe Gly He He Phe Leu Ala Cys Pro 
20 25 30 

Glu Lys Val 
35 

(2) INFORMATION FOR SEQ ID NO: 469: 

<i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Met Phe Thr Gly Trp Phe Ala Gin 

1 5 
Gly Ala Trp Gly Leu Gly Phe Met 
20 

Lys Glu Leu Leu Glu Gly Glu Tyr 
35 40 



Thr Phe Leu His Gin Phe lie Arg 

10 15 
lie Phe He Ala Phe Pro Met Gly 
25 30 
His Glu His Asp 



(2) INFORMATION FOR SEQ ID NO: 470: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 



Met Gin Glu Lys He Leu Val Thr Gly Gly Ala Gly Phe He Gly Thr 

15 10 15 

His Thr Val He Glu Leu lie Gin Ala Gly His Gin Val Val Val Val 

20 25 30 

Asp Asn Leu Val Asn Ser Asn Xaa Lys Ser Leu Glu Val Val Glu Arg 

35 40 45 

He Thr Gly Val Glu He Pro Phe Tyr Glu Ala Asp He Xaa Asp Thr 

50 55 60 

Asp Thr Leu Arg Asp He Phe Lys Gin Glu Glu Leu Thr Gly Val He 
65 70 75 80 

His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Thr Arg He Pro Leu 

85 90 95 

Ala Tyr Tyr Asp Asn Asn He Ala Gly Thr Val Ser Leu Leu Lys Ala 

100 105 110 

Met Glu Glu Asn Asn Cys Lys Asn He He Phe Ser Ser Xaa Ala Thr 

115 120 125 

Val Tyr Gly Asp Pro Tyr Thr Val Pro He Leu Glu Val Ser His Phe 
130 135 140 

Gin 
145 



(2) INFORMATION FOR SEQ ID NO: 471: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:471: 

Met Val Lys Glu Lys Val Thr Leu Thr Pro Leu Ala Ser Gly Tyr Gin 

15 10 15 

lie Gly Glu Glu Glu Phe Glu Gin Val He Leu Ala Xaa Gly Ala Trp 

20 25 30 

Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr Glu Val Asp Val Arg Pro 

35 40 45 

Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu Val Gin Asp Met Glu Ala 

50 55 60 

Tyr Pro Val Val Met Pro Glu Gly Glu Trp Asp Leu He Pro Phe Ala 
65 70 75 80 

Gly Gly Lys Leu Ser Leu Gly Ala Thr His Glu Asn Asp Met Gly Phe 

85 90 95 

Asp Leu Thr Val 2 ,p Glu Thr Leu Leu Gin Gin Met Glu Glu Ala Thr 

100 105 110 

Leu Thr His Tyr Leu He Leu Ala Glu Ala Thr Ser Lys Ser Glu Arg 

115 120 125 

Val Gly He Arg Ala Tyr Thr Ser Asp Phe Ser Pro Phe Phe Gly Gin 

130 135 140 

Val Pro Asp Leu Thr Gly Val Tyr Ala Ala Ser Gly Leu Gly Ser Ser 
145 150 155 160 

Gly Leu Thr Thr Gly Pro He He Gly Tyr His Leu Ala Gin Leu He 

165 170 175 

Gin Asp Lys Glu Leu Thr Leu Asp Pro Glu Asn Tyr Pro He Glu Asn 

180 185 190 

Tyr Val Lys Arg Val Lys Ser Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Glu His lie lie Tyr Gin Phe Glu Glu Asp Leu Ala lie Leu Thr 

15 10 15 

Leu Asn Arg Pro Glu Val Ala Asn Gly Phe His He Pro Met Cys Glu 

20 25 30 

Glu He Leu Glu Ala Leu Thr Leu Ala Glu Glu Asn Pro Ala Val His 

35 40 45 

Phe He Leu He Asn Ala Asn Gly Lys Val Phe Ser Val Gly Gly Asp 

50 55 60 

Leu Val Glu Met Lys Arg Ala Val Asp Glu Asp Asp He Pro Ser Leu 
65 70 75 80 

Thr Lys He Ala Glu Leu Val Asn Thr lie Ser Tyr Lys He Lys Gin 

85 90 95 

He Ala Lys Pro Val Phe Asn Gly Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

Met Gin Pro Glu Val He Trp Leu Gin Lys He Ala Lys Pro Ser Gin 

15 10 15 

Met Lys Val Leu Asn Thr Thr Asp Arg Val Gin Ala He Lys Asp Asp 

20 25 30 

Val Asp He He Gin Asn Ser Leu Gin He He Asn Gin Gin Lys Glu 

35 40 45 

Leu He Lys Glu Tyr Gin Glu Asp Leu Thr Tyr Lys Phe Lys Val Leu 

50 55 60 

Glu Lys Asp He Gin Thr Arg Thr Ser Cys Asp Lys Arg Asn Ala Gly 
65 70 75 80 

Asn 



342 



WO 97/37026 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Gly Ser Asn Ser Leu Thr Leu Leu Leu Lys lie Gin Ala Thr lie 

15 10 15 

Pro Arg Asn lie Tyr Asp Asn Leu Gin Ala Gly Ser Lys Val Thr Leu 

20 25 30 

Asp Arg Ala Tyr Gly His Met lie lie Glu Glu Gly Arg Glu Asn Gin 

35 40 45 

Val Trp lie Ala Gly Gly He Gly He Thr Pro Phe He Ser Tyr He 

50 55 60 

Arg riu His Pro lie Leu Asp Lys Gin Val His Phe Tyr Tyr Ser Phe 
65 70 75 80 

Arg Gly Asp Glu Asn Ala Val Tyr Leu Asp Leu Leu Arg Asn Tyr Ala 

85 90 95 

Gin Lys Asn Pro Asn Phe Glu Leu His Leu He Asp Ser Thr Lys Asp 

100 105 110 

Gly Tyr Leu Asn Phe Glu Gin Lys Lys Cys Pro Asn Met Gin Pro Ser 

115 120 125 

He Cys Val Val Leu Phe Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 
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Met Phe Phe Val Leu Ala 

1 5 
Lys Leu His Pro lie Gin 
20 

Lys Asp Val Asp Phe Ala 
35 

Asp Gin Asp Phe Leu Glu 
50 

Gin Ala Ser Lys Val lie 
65 70 
Gin lie Leu Ser Asn Pro 
85 

Glu Met Ala Val Arg He 
100 

Glu Glu Leu Val Glu Leu 
115 

Val Arg Arg Leu Leu Thr 
130 

Leu Pro Glu Ala He His 
145 150 
His Leu Lys Ser Leu Lys 
165 

Lys Glu Pro Trp Asp Ala 
180 

Leu Gly Lys Pro Ser He 

195 200 
Arg He Glu Thr Asn 
210 
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Val Ala Gly Arg Asn He 
15 

Gly Tyr His Ser Val Asn 
30 

Leu Arg Gin His Gin Lys 
45 

Ser Val Ala Leu Phe Glu 
60 

Phe Pro Leu Leu Arg Tyr 
75 80 
He Tyr Gin Val Asn Gin 
95 

Lys Thr Ala Gin Ser Val 
110 

Arg Tyr Thr Lys Ala Arg 
125 

Gin Ala Arg Glu Ser Asp 
140 

Thr Glu Lys Gly Arg Gin 
155 160 
Leu Val Ser Arg He Gly 
175 

Val Asp Gin l^e Tyr Gin 
190 

Phe Gly Arg Val Pro He 
205 



Tyr Ala Lys Ala 
10 

Arg Gin Gly Ala 
25 

Ser Ala Thr Ala 
40 

Arg Phe Met Pro 
55 

Trp Glu Asp Tyr 

Asp Leu Thr Thr 
90 

Lys Glu Ala He 
105 

Val Thr Thr Lys 
120 

Tyr He Leu Met 
135 

Val Leu Gly Phe 

Gly Gin Val Asn 
170 

Met Thr Gin Lys 
185 

Ala Glu Gin Asn 



(2) INFORMATION FOR SEQ ID NO: 47 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

Met Leu Gly Ser Met Phe Val Gly Leu Leu Val Gly Phe Leu Ala Gly 
15 10 15 
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Ala Met Thr Asn Arg Gly Glu Arg 
20 

Leu Gly Trp lie Gly Ala Phe Leu 

35 40 
Gly Pro Val Leu Ser Gly Thr Ala 

50 55 
Met He Val Leu Ala He Phe Trp 
65 70 
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Met Gly Cys Phe Gly Lys Met Phe 
25 30 
Gly His Leu Leu Phe Gly Thr Trp 
45 

He He Pro Ala Val Leu Gly Ala 
60 

Arg Arg Gly Ser 
75 



(2) INFORMATION FOR SEQ ID NO: 477: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



Met Arg Ser Ser Pro Ser Arg Glu 

1 5 
Thr Thr Val Tyr Val Thr His Asp 
20 

Asp Gin He Ala Cys Tyr Glu Arg 

35 40 
Thr Lys Arg Thr Val Ser 
50 



He Gin Pro Arg Ser Trp Gly lie 

10 15 
Gin Glu Gly Ala Met Ala lie Ser 
25 30 
Trp Gly Asp Pro Thr Asn Arg Pro 
45 
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WHAT IS CLAIMED IS: 
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1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

5 (a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 

polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ID 
NO:224 through 477, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
1 0 same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 

from the group consisting of SEQ ID NO: 1 through 223 contained in the DNA of the deposited 
• clone; and 

(d) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide 
of (a), (b)or(c). 

IS 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising r' e nucleic acid sequence set forth in 
SEQ ID NO: 1. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ID NO: 1 through 223. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:224 through 477. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

25 9. A process for producing a polypeptide comprising: expressing from the host 

cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment. 

30 11. A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:224 
through 477. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:224 through 477. 
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13. An antibody against the polypeptide of claim 1 1 . 

14. An antagonist which inhibits the activity or expression of the polypeptide of 
claim 11. 

15. A method for the treatment of an individual in need of a Streptococcal 

5 polypeptide comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 1 1 . 

16. A method for the treatment of an individual having need to inhibit a bacterial 
polypeptide comprising: administering to the individual a therapeutically effective amount of the 
antagonist of Claim 14. 

10 17. A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from 
the individual. 

15 18. A method for identifying compounds which interact with and inhibit or activate 

an activity of the polypeptide of claim 1 1 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened 

under conditions to permit interaction between the compound and the polypeptide to assess the 

interaction of a compound, such interaction being associated with a second component capable of 
20 providing a detectable signal in response to the interaction of the polypeptide with the 

compound; 

and determining whether the compound interacts with and activates or inhibits an 
activity of the polypetide by detecting the presence or absence of a signal generated from the 
interaction of the compound with the polypeptide. 

25 19 - A method for inducing an immunological response in a mammal which 

comprises inoculating the mammal with Streptococcal polypeptide of claim 1 1, or a fragment 
or variant thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from disease. 

20. A method of inducing immunological response in a mammal which comprises 

30 delivering a nucleic acid vector to direct expression of Streptococcal polypeptide of claim 1 1 , 
or fragment or a variant thereof, for expressing said Streptococcal polypeptide, or a fragment 
or a variant thereof in vivo in order to induce an immunological response to produce 
antibody and/ or T cell immune response to protect said animal from disease. 
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21 . An isolated Streptococcal polypeptide having one of the amino acid 
sequences of SEQ ID NO:224 through SEQ ID NO:234. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
Claim 21 and nucleic acid sequences capable of hybridizing therewith under string nt 

5 conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 

10 capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 



348 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/US97/05306 


A. CLASSIFICATION OF SUBJECT MATTER 




IPC(6) Pkasc See Extra Sheet. 




US CL :536723.7; 435/6, 69.1, 252.3, 320.1; 530/324, 350; 514/12 




According to International Patent Classification (IPC) or to both national classification an** IPC 


B. FIELDS SEARCHED 



Minimum documentation searched (clarification system followed by classification symbols) 
U.S. : 536/23.7; 435/6, 69.1. 252.3. 320.1; 530/324, 350; 514/12 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable,, search terms used) 
Please See Extra Sheet. 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant 10 ebim No. 


X, P 
Y f P 

Y 
Y 


WO 96/10647 A1 (FUSO PHARMACEUTICAL INDUSTRIES, 
LTD.) 11 April 1996, claim 2, pages 22-24. 

WATSON, J. D. et al. Recombinant DNA, Second Edition. 
New York: Scientific American Books, W.H. Freeman and 
Company, 1992, pages 73 and 74, see entire document. 

KENNELL, D. E. Principles and Practices of Nucleic Acid 
Hybridization. Progress in Nucleic Acid Research Molecular 
Biology. 1971 , Vol. 1 1 , pages 259-301 , see especially pages 
259-262. 


1, 2, 7, 8, and 
22 

3, 6, 9-12. 15, 
17, 21, and 23- 
25 

1-3, 6-12, 15, 
17. and 21-23 

1-3, 6-12, 15, 
17, and 21-23 


| | Purther documrnti are listed in the continuation of Box C. | j See patent family annex. 


. , ^ ^ . . . . 4*>**maimcamI^*gk1to*MAk**m*+c***>\\ti\m\ i ISm 

*L" timnit whidfc way *wjr<^> o» priotfr chkaft) or whiA » iteStdoaMliiMkaikM 

| m wt t i 4' \ | j oandarari to ■votvt as ■vnhn ikv vfcs Sto 4op^bmC ii 
mmm U«ff ofcvm la a penon akiUMl is «w ait 

•r < ■ hum »■ p Mfc IwainJ p ri.» to Sk* i in 1 1 ■ ■ 1 filj^ d—e bia smw a^— inf — ■ l -„,,i„, , f< ■ ■ r i 
g»pnoftyd*B«ckud annawi ■mHii i of lie «me patcal fiati* 


Date of the actual completion of the international search 
10 JULY 1997 


Date of mailing of the international search report 

0 6 AUG 1997 


Name and mailing address of the ISA/US 
Commiuio— r ofParaaii aad Trademarks 

Box PCT 

W«ilu«fioo, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer , J 
Telephone No. (703) 308-0196 ! 



Form PCT/1S A/210 (second sheetXJury 1992)* 



INTERNATIONAL SEARCH REPORT 



International sppliration No. 
PCT/US97/05306 



Box I Obsertatms where eertaja claims were round unsearchable (Coaiiauatioa of ken 1 of Tint sheet) 
Thai internsiional raport ha* not been established in respect of oertain claims under Article 17(2X») for the following reasons: 

1. rn Claims Nos.: 

^— ' because they relate to subject matter not required to be searched by this Authority, namely: 

2. I | CUimi Nos.: 

*— * because they relate to parts of the imenuUioasi application that do oot comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 

3. Q Claims Nos.: 

because they are dependent claims and are not drafted m accordance with the second and third s entence s of Rule 6.4(a). 

Box II Obserratioaf where nasty of isjTeatkw is Lacking (Coatiauaiion of kern 2 of first sheet) 
This Internationa] Searching Authority fo Ad multiple inventions in this international application, as follows: 
Please See Extra Shoot. 



1. I ] As all required additional search fees were timely paid by the applicant, this utternational search report covers all searchable 

claims. 

2. [^] As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any arktirionsl fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. | x| No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-3, 6-12, 15. 17, and 21-23 (SEQ ID NOs 224-233). 



Remark oa Protest | [ The additional search fees were accompanied by the applicant's protest. 

[ 1 No protest accompanied the payment of additional search fees. 



Form PCT/IS A/210 (coittmuation of first shcet<l)XJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International Application No. 
PCT/US97/05306 



BOX U. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING (Continued): 

Thii application oonuins the following inventions or groups of invention! which are r** so linked at to form a tingle 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
foes must be paid. 

Croup 1. Claims 1-12, 15, 17, and 21*23, drawn to nucleic acids and polypeptides of specific sequence, vectors 
containing nucleic acids, methods of producing polypeptides, and methods of treatment using polypeptides. 

Group II. Claims 13 and 19, drawn to antibodies against any of no fewer than 254 specific seque n ce polypeptides. 

Group ID. Claims 14 and 16, drawn to antagonists of undisclosed composition of the activity of no fewer than 254 
specific sequence polypeptides. 

Group IV. Claims 18 and 24, drawn to methods for screening for compounds that interact with no fewer than 254 
specific sequence polypeptides. 

Group V. Claim 20, drawn to a method of inducing an immunological response using a nucleic acid. 
Group VI. Claim 25, drawn to compounds of undisclosed composition. 

The inventions listed as Groups I-VI do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the aame or corresponding special technical features lor the following reasons. The nucleic 
acids, proteins, and methods of Group I are materially different from the antibodies of Group II, the compounds of 
Group HI, and the compounds of Group VI and are not required to produce cither the antibodies of Group II, which 
antibodies may be made with antigens other than those embraced by Group I. Additionally, the methods of Gr jp I arc 
not needed to produce the compounds of either one of Groups III or VI. The compositions and methods of G ap I are 
not needed for the methods of cither one of Groups IV or V. The antibodies of Group II are not needed for the 
methods of Groups IV or V and are materially different from the compounds of Groups III and VI. The antagonists of 
Group in are materially different from the compounds of Group VI because the antagonists and compounds have 
differing sites of action and/or binding. Neither are the compounds of cither one of Groups III or VI needed to practice 
the method* of either one of Groups IV or V. The methods of Groups IV and V may be practiced mdependently of one 
another. 

This application contains claims directed to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13.1. 
In order for more than one apeeies to be searched, the appropriate additional search fees must be paid. The species are 
as follows . Each of the 234 SEQ ID NOi listed is a separate species because there is no relationship between the 
species. 

The species listed above do not relate to a single inventive concept tinder PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the aame or corresponding special trrhnirel features for the following reasons. There is no 
relationship among the species. 

For any Group applicant elects, a total of 10 (TEN) specified sequences will be searched and no more than 4 (POUR) 
specified sequences will be searched for each additional fee paid. 



The claims arc deemed to correspond to the species listed above in the following manner: SEQ ID NOs 224-233 
correspond to claims 1-3, 6-12, 15, 17, and 21-23. 

A. CLASSIFICATION OP SUBJECT MATTER: 
IPC (6): 

C12N 15/31. 15/11, 15/00; C12P 21/02; A61K 38/16. 39/09; C07K 14/195. 14tfl5 
B FIELDS SEARCHED 

F l nolm n ic data bases consulted (Name of data base and where practicable terms used): 

MPSRCH OF: A-GENESEQ26, A-1SSUED, HIV-AA8, PIR50, SUIWW-PROT34, EMBL-NEWll, GENBANK97, 
GENBANK-NEW1 1 , U-EMBL48.97, HIV-NA8, N-GENESEQ26, N-ISSUED. EST-STS, EST-STS-TWO, EST-STS- 



Fonn PCT/IS A/210 (extra sheetXJuly 1992)+ 



INTERNATIONAL SEARCH REPORf 



Internatioml tppticatkm No. 
PCT/US97/05306 



THREE. SEQ ID NOt 224-233. 



Pom PCT/1S A/210 (cxtim abcctXJuly 1992)* 



